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TITLE:  Doing  C?  (Comma-id  Control)  Experiments  Using  War  Games. 

A  Comparison  of  Two  Air  Defense  Command  and  Control 
Model. 0. 

Lancaster's  Equations  and  Game  Theory. 

Information  Processing  and  Decisionmaking  Organizations: 

A  Mathematical  Description. 

A  Model  for  Asynchronous  Distributed  Computation. 

A  Unified  Approach  to  Modeling  end  Combining  c.f 
Evidence  through  Random  Set  Theory. 

Software  for  Explicitly  Probabilistic  Mathematics. 

Command  Decision  Makers  and  Their  Modes  of  Interaction. 
Information  Theoretic  Models  of  Memory  in  Human 
Decisionmaking  Models. 

Human  Deci sionmaking  in  Dynamic  Environments  with 
Increasing  Information  Processing  Demands. 

Individual  Differences  in  Military  Decisionmaking: 

The  Classification  Performance  of  Active  Sonar  Operators. 
Decision  and  Display  Analysis  in  a  Simple  Surveillance 
Problem. 

The  Effects  of  a  Spatial  Information  Format  on  Decision 
Making  Performance  in  a  C3  (Command,  Control,  and 
Comiminications)  Probablistic  Information  Integration 
Task. 

Development  of  a  Generalized  Human-Machine  Interface. 
Tactical  Decision  Aids  Which  Quantify  Judgment. 

A  Knowledge  Based  Interactive  Procedure  for  Planning 
and  Decision  Support  under  Uncertainty  and  Parameter 
Imprecision. 

An  Evaluation  of  ARIADNE  (Alternative  Ranking  Interactive 
Aid  Based  on  Dominance  Structure  Information  Elicitation). 
Research  on  Cognitive  Collaboration  between  Persons 
and  Computers. 

Coordination  of  Battle  Group  Warfare  Commanders  through 
Summary  Display  Transfers. 

A  Man-Machine  Interface  Concept  for  a  State-of-Art,  ~  . 
Shinboard,  Command/Control  Console.  ■ 


F011 WORD 


The  Sixth  MIT/011  Jt?  Workshop  was  hold  from  July  25,  19S3  through  July 

n 

29,  1983.  SsstioDs  on  ths  first  four  days  wars  hold  at  tha  Massaebusatts 
Isstituta  of  Tachnology,  Caabridga,  Massachusetts,  sad  these  Proceedings 
constitute  the  written  record  of  the  work  presented  there.  The  fifth  day's 
session  was  held  at  The  MITIZ  Corporation,  Bedford,  Massachusetts,  and  was 
organised  separately  by  the  Office  of  Kaeal  Rsaarch.  As  the  topics  under 
discussion  there  were  classified,  the  corresponding  papers  haee  not  been 
included  in  tnis  volume. 

The  Workshop  attracted  approximately  160  persons;  its  greatest  strength 
vaa  the  variety  of  backgrounds  represented  by  these  participants.  Leaders 
from  government  laboratories  and  funding  agencies,  managers  and  technical 
staff  of  high  technology  firms,  and  faculty,  staff,  and  students  from 
academic  institutions,  all  came  together  to  contribute  to  the  teady 
evolution  of  C*  from  an  art  to  a  science. 

Several  themes  continued  to  appear  throughout  this  Workshops  themes 
which  can  be  traced  back  through  the  preceding  ones.  These  are  reflected  in 
the  section  headings  used  in  the  organisation  of  tha  proceedings.  Tha  most 
challenging  problem  facing  scientists  is  that  of  devising  means  to 
evaluate  tha  performance  of  alternative  cj  system  architectures.  This  muat 
take  into  account  the  sensors,  communications,  and  weapons  available  to 
friendly  as  well  as  enemy  forces.  It  must  involve  a  closed-loop,  cybernetic 
model  of  tha  entire  military  situation  and  its  must  inevitably  involve  a 
deep  understanding  of  the  behavior  of  the  Romans  which  execute  the  key 
decision  functions  from  sonar  operators  through  battle  group  commanders. 
From  the  perspective  of  design,  ^surveillance  has  been  reduced  to  a 
relatively  precise  science,  at  least  compsred  with  the  design  of  the 
decision  function. 

Tha  purpose  of  these  Workshops  is  to  foster  real-time  interaction 
between  many  of  tha  key  contributors  in  the  developing  science  of  C^.  In 
this  respect,  the  Sixth  Workshop  was  a  significant  success.  We  look  forward 
to  the  Seventh  Workshop,  to  be  held  from  June  11  to  June  13,  1986  in  sunny 
San  Diego,  California,  to  be  equally  successful. 


Robert  1.  Tenney 
Cambridge,  Massachusetts 
December  1983 
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SECTION  I 

Evaluation:  Methods  and  Tools 


DOING  C2  EXPERIMENTS  USING  WAR  GAMES 


Or.  Joel  S.  Lawson,  Jr. 

Technical  Director,  C^I  Systems  A  Technology  Directorate  (ELEX  06T) 


Naval  Electronic  Systems  Command 
Washington,  OC  20363 


INTRODUCTION 

In  the  last  few  years  considerable  progress  has  been  made  In  the  development  of  an  analytic  theory  of 
military  Conaand  Control  (C2),  both  as  a  process  and  as  a  large-scale  system.  There  are  now  models  of  C2 
organizations  which  permit  the  examination  of  the  effects  of  various  changes  In  a  C2  system  and  which  can 
predict  some  of  the  behavior  of  such  a  system  In  a  gross  sense.  And,  due  to  the  Increased  attention  being 
given  the  field,  with  the  attendant  Increase  In  papers,  workshops,  etcetera,  there  Is  slowly  developing  a 
coenon  vocabulary  for  use  In  the  emerging  *C2  Theory." 

i 

What  Is  still  lacking,  however.  Is  a  body  of  experimental  data  which  can  be  used  as  a  "touchstone*  to 
guide  further  theoretical  developments,  and  against  which  theoretical  predictions  can  be  tested. 

This  paper  reports  the  results  of  a  very  rudimentary  experiment  which  was  conducted  at  the  Naval  Post¬ 
graduate  School  at  Monterey,  California,  during  the  1983  Winter  Quarter  to  test  two  specific  hypotheses.  As  Is 
often  the  case.  It  was  found  necessary  to  modify  or  restate  the  hypotheses  during  the  conduct  of  the  exoerlment 
in  order  to  accommodate  certalnY'real  wor1d*"constra1nts.  The  results,  however,  are  both  Interesting  1  'heir 
own  right  and  reassuring  for  the  prospect  of  being  able  to  do  further  experiments  In  the  C2  arena. 


Motivation  for  the  Experiments 


The  practical  motivation  for  this  experiment  lay  In  a 
desire  to  study  the  "decision  making"  process  In  a  C2 
process.  "Decisions"  are  one  of  the  two  major 
products  of  a  C2  system.  (The  other  major  C2  system 
product  may  be  considered  to  be  "plans"  which  are 
statements  of  actions  to  be  taken  by  vai  ;ous  parts  of 
the  system.  They  are  generated  either  to  implement  a 
previous  decision  or  to  provide  for  possible  future 
decisions.  In  many  cases  they  have  the  nature  or 
"pne-determlned  decisions.")  It  was  this  motivation 
which  led  to  the  approach  and  the  experimental 
procedure  adopted. 


In  keeping  with  a  “hard  science"  approach  to  the 
development  of  a  C2  Theory,  we  wanted  to  find 
"observables"  which  could  be  observed  by  people  who 
were  not  part  of  the  system  and  which  could  also  be 
observed  by  other  Investigators,  In  other  experiments. 


It  should  be  noted  that,  by  the  nature  of  the  C2 
business,  which  Involves  both  people  and  complex 
scenarios.  It  Is  nearly  Impossible  to  "replicate"  cn 
experiment.  People,  unfortunately,  learn,  and  their 
behavior  on  a  second  trial  will  be  different  from 
what  It  was  on  the  first.  Also,  different  ‘‘cotnrand 
teams"  or  battle  staffs  will  have  different 
approaches  or  "styles"  In  dealing  with  the  same  basic 
problem.  Furthermore,  In  "free  play"  exercises, 
which  are  C,e  only  ones  In  which  real  decision  making 
take,  place,  the  dynamics  of  the  entitles  In  the 
scenario  very  seldom  repeat.  In  particular,  succes¬ 
sive  plays  of  the  same  scenario  generally  do  not  lead 
to  the  same  physical  configuration  of  the  battlefield 
after  even  a  few  minutes  of  play.  Therefore, 
"replication,"  In  the  sense  it  Is  used  In  physic-  or 
chemistry.  Is  a  practical  impossibility.  Rather,  we 
must  search  for  observables  which  are  In  some  way 
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connected  to  the  decision-making  process  and  which 
can  be  observed  In  numerous  settings,  but  which  are 
independent  of  the  details  of  the  scenario  and  the 
particular  participants. 

Pursuing  this  line  of  reasoning.  It  was  decided  that 
an  interesting  characteristic  quantity  might  be  the 
time  Interval  between  decisions.  Independent  of  what 
the  decisions  were.  A  histogram  of  this  Interval 
against  the  number  of  case;  In  which  It  was  observed 
shoula  gi t-iT-eng—jr.me  idea  about , the  tempo  of  activity 
In  a  command  center. 

But  this  leads  to  the  question  of  defining  "a 
decision"  and  how  It  Is  to  be  observed.  At  first  It 
was  felt  that  decisions  could  be  grouped  into  broad 
classes,  such  as  tactical,  strategic,  or  logistics, 
and  observed  or  logged  by  listening  to  the  conversa¬ 
tions  between  members  of  the  battle  staff.  In 
practice,  this  turned  out  to  be  virtually  Impossible, 
and  a  simpler  but  more  exact  definition  of  a  decision 
was  adopted. 

Specifically,  decisions  were  defined  as  space-time 
events  made  marl  rest  by  either  utterances  or  a  series 
of  keystrokes  on  a  computer  terminal  which  request  or 
transmit  Information  or  directions.  This  definition 
satisfies  the  rew! r-^ment  that  the  decision  be 
observable,  I.*.,  i«  Is  given  physical  reality.  (My 
decision  to  hr  •  *eak  for  dinner  tomorrow  has  no 
reality  until  I  mi  it  known  to  the  outside  world  by 
telling  someone  *  •  This  choice  also  takes  us 
down  to  the  "on-  •  .  v;  level"  In  the  decision-making 
process.  A  d.c  ■  co  ask  rot  information,  or  to 
send  a  message,  is  .  w  treated  equally  with  a 
decision  to  engage  -he  enemy.  (Perhaps  a  better  name 
for  "decisions"  ce'nied  In  this  way  would  be  "trans¬ 
actions,"  but  ter  •  *  i  we  shall  continue  to  call  them 
decisions.)  Tbi  •  flnitlon  of  decision  led  directly 
to  a  revised  verrlon  of  the  original  first  hypothesis: 


A  CoMunder  or  Battle  Staff  produces  decisions 
which  are  evidenced  by  utterances  or  keystrokes 
and  which  can  be  observed  by  external 
observers.  Furthermore,  different  external 
observers  will  agree  on  the  "class"  to  which  the 
observed  decision  belongs  If  the  classes  arc 
chosen  In  a  reasonably  general  manner. 

The  notion  of  a  “class"  of  decisions  will  be 
described  further  below,  when  we  discuss  the 
experimental  procedure.  But  before  going  on  to  that. 
It  will  be  recalled  that  there  was  a  second 
hypothesis  to  be  tested  In  this  experiment.  The 
original  form  of  this  hypothesis  was  stated  as 
follows: 

The  histogram  of  time  Intervals  between 
decisions  should  give  an  Indication  of  the  state 
of  training  or  competency  of  the  Commander  or 
Battle  Staff. 

It  seemed  reasonable  to  assume  that  a  relatively 
untrained  team  would  take  almost  random  amounts  of 
time  to  arrive  at  each  of  a  series  of  decisions, 
leading  to  a  nearly  flat  histogram.  At  the  same 
time,  a  more  experienced  group  would  have  better 
defined  work  habits  and  evidence  a  more  peaked 
distribution.  In  fact,  one  might  expect  a  blmodal 
distribution,  with  simple  or  routine  decisions  being 
made  quickly  and  effortlessly,  while  more  complex 
procedures  took  longer.  Alternatively,  a  complei 
problem  might  result  In  a  long  time  delay,  followed 
by  a  flurry  of  activity  with  short  apparent 
Inter- transaction  times. 

Unfortunately  the  data  collected  In  this  experiment 
do  not  allow  for  a  simple  determination  of  which 
decisions  may  have  resulted  from  some  previous  one. 
Nor  Is  there  any  direct  way  to  associate  one  decision 
with  another  In  an  Input-output  sense.  Investigation 
of  these  matters  will  have  to  await  an  Improved 
experimental  procedure. 

The  Experimental  Setting 

An  opportunity  to  test  these  hypotheses  arose  during 
the  1983  Winter  Quarter  at  the  Naval  Postgraduate 
School,  Monterey,  California. 

A  series  of  computer-based  war  games  were  to  be  run, 
using  the  Warfare  Environment  Simulator  (WES)  at  the 
Naval  Ocean  Systems  Center  (NOSC),  San  Clego, 
California,  to  which  the  Postgraduate  School  has  a 
remote  access  capability.  The  purpose  of  these  games 
was  to  give  the  students  In  the  Operations  Research 
curriculum  an  opportunity  to  get  “hands-on" 
experience  with  the  operation  of  a  war  game.  Thus, 
the  subject  "Battle  Staffs"  were  certainly 
Inexperienced  and.  In  fact,  being  made  up  of  members 
of  all  four  services,  many  of  the  players  were 
unfamiliar  with  Navy  terminology  and  procedures. 

The  WES  facility  at  the  Postgraduate  School  at  that 
time  consisted  of  three  sets  of  displays,  one  each 
for  the  Orange  and  Blue  teams  and  one  for  the  umpire 
or  game  coordinator.  Each  set  of  displays  had  a 
large  graphic  display  of  the  theater  of  operations  In 
the  center  with  a  smaller  alphanumeric  display  and 
keyboard  on  each  side  of  It.  The  scale  of  the 
geographic  display  could  be  changed  to  provide  a 
zoom"  capability  and  appropriate  symbology  depicted 
t'»e  location  of  objects.  One  of  the  alphanumeric 
displays  was  generally  used  only  as  a  status  board, 
while  the  other  was  used  to  give  Instructions  to  the 
machine  to  effect  control  of  that  side's  forces. 


The  display  systems  In  the  Blue  and  Orange  command 
posts  showed  only  those  targets  which  had  been 
detected  by  their  own  sensor  systems,  while  the 
umpire's  position  had  access  to  the  data  as  seen  by 
each  side  as  well  as  to  ground  truth. 

The  class  running  the  war  game  was  split  Into  three 
groups  (A,  B,  and  C)  of  two  (Orange  and  Blue)  three- 
man  teams.  Each  was  a  familiarization  session  In 
irfilch  they  learned  how  to  operate  the  keyboard  and 
make  the  machine  do  some  elementary  things.  In  the 
second  session  they  actually  did  play  a  war  game, 
deploy  their  forces,  and  get  to  the  point  of  engaging 
the  enemy.  The  final  session  was  run  “for  the 
record"  and  was  based  on  a  scenario  and  Initial  force 
disposition  very  similar  to  the  ones  with  which  they 
had  practiced.  By  this  time  the  Battle  Staffs  seemed 
to  be  reasonably  comfortable  with  the  system  and 
could  concentrate  most  of  their  effort  on  the  battle 
rather  than  the  game  mechanics. 

To  carry  out  the  experiment  and  test  these  hypo¬ 
theses,  observers  were  recruited  from  other  students 
enrolled  in  the  Command  Control  and  Commmicatlons 
curriculum  at  the  Postgraduate  School,  and  arrange¬ 
ments  made  to  have  them  present  during  the  running  of 
the  games. 

Only  limited  observations  were  made  during  the  first 
series  of  runs,  to  test  the  methodology.  The  data 
reported  here  were  all  taken  during  the  second  and 
third  series  of  runs,  when  the  performance  of  the 
Battle  Teams  had  become  reasonably  stable. 

Experimental  Procedure 

For  the  experiment,  each  observer  was  equipped  with  a 
clipboard  of  data  sheets  wnlch  had  a  synchronized 
clock  mounted  on  It.  They  were  Instructed  to  record 
to  the  nearest  second  when  they  thought  they  observed 
a  "decision"  and  the  "class"  of  that  decision.  A 
sample  of  the  data  sheet  Is  reproduced  In  Figure  1, 
which  also  shews  the  classes  of  decisions  used  in 
this  experiment.  These  classes  were  Intended  to  be 
generic  in  nature  and  encompass  nearly  all  the  situa¬ 
tions  which  were  expected  to  be  observed. 

The  observer  was  also  asked,  if  ne  had  time,  to  enter 
any  amplifying  Information  In  the  "Notes"  column. 

The  columns  for  "Game  Time"  and  "Enter  Time"  were 
provided  to  allow  correlation  to  game  time  (which  can 
be  faster  or  slower  than  clock  time)  and  to  note  the 
time  when  an  Instruction  which  had  been  observed 
passing  from  one  team  member  to  another,  was  actually 
entered  on  the  keyboard.  The  latter  turned  out  to  be 
an  Impossible  task  as  the  keyboards  were  In  such 
constant  use  is  to  preclude  connecting  any  particular 
set  of  keystrokes  with  any  specific  utterance.  (In 
future  experiments  It  may  be  possible  to  achieve  some 
of  these  measurements  by  Installing  appropriate 
"hooks"  In  the  game  software.) 

After  the  experimental  session,  the  observations  were 
transcribed  Into  a  computer  program  which  computed 
the  desired  time  Intervals  and  could  generate  various 
histograms  of  the  results.  Figure  2  Is  a  sample  of  a 
data  file  generated  by  this  program. 

As  shown  In  the  Figure,  It  was  eventually  decided  to 
classify  the  decisions  by  "type"  as  well  as  "class." 
This  was  partly  due  to  the  confusion  between  dif¬ 
ferent  observers  as  to  which  class  a  particular 
decision  should  be  placed  In,  and  partly  because  of 
the  limited  number  of  observations  of  members  of  a 
particular  class.  By  Introducing  a  definition  of 
types"  of  decisions  as  either  "Information 
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decisions*  or  ‘action  decisions,*  the  sample  size  was 
effectively  Increased  and  the  differences  between 
observers  was  reduced  to  a  negligible  amount. 

Examples  of  this  distinction  are  decisions  which  deal 
with  ‘Information,*  such  as  requesting  status  or 
Identity,  and  those  decisions  which  deal  with 
‘action,*  such  as  deploying  a  force  or  changing  the 
EMCON  condition. 

With  these  additions  to  the  data  definitions,  the 
time  Intervals  are  defined  as  follows: 

TO  «  real  world  clock  time 

T1  *  time  since  last  decision.  Independent  of 
Its  class 

T2  ■  change  In  T1  since  last  T1  event 

T3  ■  time  since  last  decision  of  same  "type* 

T4  »  change  In  T3  since  last  T3  event 

T5  ■  time  since  last  decision  of  same  "class* 

T6  ■  change  In  T5  since  1st  T5  event 

By  generating  such  data  files  on  disk,  each  repre¬ 
senting  the  observations  of  one  observer  (using  the 
observer's  name  as  file  name)  during  a  game  he 
observed  (using  the  team  name  and  game  number  as  a 
file  extension),  the  data  was  set  up  to  allow  com¬ 
parison  of  different  observers  as  well  as  different 
teams  and  games. 

The  results  of  this  analysis  are  reported  in  the  next 
section. 

I  Experimental  Results 

Before  examining  the  specific  experimental  results,  a 
word  about  the  choice  of  "classes  of  transactions'  Is 
In  order  because  they  turned  out  to  Influence  the 
apparent  results.  In  particular,  the  distinction 
between  ordering  the  deployment  of  one  or  two  units 
or  platforms  and  ordering  the  deployment  (e.g.,  a 
course  change)  of  the  whole  force  was  Interpreted 
differently  by  different  observers.  And  the  same  was 
true  of  requests  for  the  status  of  Individual  units 
Or  of  a  major  portion  of  the  force.  Much  of  this 
Confusion  could  have  been  overcome  If  there  had  been 
more  time  to  test  the  methodology  and  have  the  data 
takers  agree  on  some  conventions.  As  It  was,  there 
was  a  minimum  amount  of  coordination  between  the 
Observers,  and  each  one  made  up  his  own  rules  as  to 
how  he  would  classify  the  events  he  observed. 

The  basic  classes  were  originally  chosen  because  It 
was  assumed  that  they  were  sufficiently  general  to 
encompass  all  the  transactions  which  were  likely  to 
take  place,  while  also  being  representative  of 
distinctly  different  sorts  of  activity  which  one 
would  expect  to  see  In  a  command  center. 

In  practice.  It  appecrs  that  these  were  reasonable 
assumptions,  but  that  the  definition  of  the  "classes 
of  transactions"  can  probably  be  Improved,  and  In 
fact  should  probably  be  tailored  to  match  the  nature 
of  the  scenario  and/or  level  of  command  which  Is 
being  examined.  Figures  3  and  4  present  a  typical 
comparison  of  the  time  between  transactions.  Inde¬ 
pendent  of  the  type  or  class  of  transaction,  as 
recorded  by  two  different  observers  watching  the  same 
battle  staff.  These  Figures  are  histograms  of  the 
time  Intervals  between  observed  decisions,  grouped 
Into  18-second  wide  bins.  (The  units  of  "width"  are 
seconds.)  For  ease  of  comparison,  the  actual  number 
of  observations  has  been  normalized  to  100  so  these 
represent  percentage  distributions. 

It  Is  Immediately  obvious  that  the  observers  were 
apparently  observing  the  same  sort  of  behavior  and 


recording  a  similar  series  of  events.  A  detailed 
comparison  of  their  event  logs  bears  this  out.  In 
fact.  In  all  cases  where  there  were  two  or  more 
observers  watching  the  same  battle  staff,  there  was 
remarkable  agreement  In  what  they  logged  as 
"decisions*  or  transactions. 

The  i.wo  major  sources  of  nonconformity  of  the  logs 
were  the  differences  of  Interpretation  mentioned 
above  and  a  very  simple  but  Important  physical  limi¬ 
tation.  Because  the  battle  staff  were  sitting 
side-by-side  In  front  of  the  displays,  the  observers 
tended  to  sit  on  either  side  of  them  so  they  could 
both  see  the  displays  and  hear  what  the  group  was 
talking  about.  This  led  to  one  observer  being  able 
to  hear  remarks  about  what  was  on  the  status  board 
better  than  the  other,  while  the  second  was  more 
alert  to  directions  to  the  keyboard  operator. 

Nonetheless,  as  a  general  rule  the  overlap  of  the 
logs  of  pairs  of  observers  was  about  60  percent,  even 
without  any  serious  attempt  to  agree  on  just  how  the 
classification  scheme  would  be  applied.  At  the 
grosser  level  of  whether  It  was  an  ‘action*  trans¬ 
action  or  one  dealing  with  information,  the  agreement 
was  nearly  80  percent.  The  "disagreements*  are 
nearly  all  accounted  for  by  a  missing  entry  on  one  of 
the  logs.  That  is,  one  observer  logged  an  event 
which  the  other  did  not.  In  most  cases  this  can  be 
explained  as  a  result  of  the  placement  of  the 
observer  relative  to  the  battle  staff,  as  mentioned 
above. 

Moreover,  data  taken  by  the  same  observer  watching 
different  Battle  Staffs  produces  plots  very  similar 
to  those  shown  In  Figures  3  and  4.  So  the  distribu¬ 
tion  of  time  Intervals  between  decisions  does  seem  to 
be  a  characteristic  quantity,  at  least  In  this 
setting. 

An  Interesting  change  Is  observed,  however,  when  the 
time  between  successive  action  or  Information  trans¬ 
actions  Is  plotted,  as  In  Figures  5  and  6.  The 
action  decisions  still  show  the  Rayleigh-like  distri¬ 
bution,  while  the  Information  decisions  are  more 
uniformly  distributed.  No  explanation  for  this  Is 
offered,  but  an  Interesting  comparison  with  some 
other  data  will  be  made  below.  (Also  It  was  noted 
that  some  observers  seemed  to  be  more  sensitive  to 
action  decisions  than  to  Information  decisions,  per¬ 
haps  reflecting  their  personality  more  than  their 
location  In  the  command  space.) 

Based  on  these  results.  It  seems  that  the  first  of 
the  hypotheses  to  be  tested  Is  confirmed  by  the  data 
trfilch  was  taken  during  this  experiment. 

As  far  as  the  second  hypothesis  Is  concerned,  we  can 
only  report  that  there  is  no  significant  difference 
between  the  data  obtained  In  the  second  and  third 
sets  of  war  games.  If  there  was  any  "learning 
effect,"  It  was  masked  by  other  attributes  of  the 
game. 

A  final  Interesting  observation  Is  that  In  essential¬ 
ly  all  cases  the  observers  logged  about  two-thirds  of 
the  transactions  as  action  decisions  and  only 
one-third  as  Information  decisions.  This  may  be 
partly  due  to  the  nature  of  the  HES  facility.  In 
which  the  Battle  Staffs  have  to  issue  a  rather  large 
number  of  maneuvering  Instructions  to  their 
(computer-simulated)  forces.  This  has  the  effect  of 
putting  the  Battle  Staff  In  a  much  more  "tactical" 
role  than  a  "command*  role.  Also,  with  only  one 
command  center  on  each  side,  there  were  no  superiors 
or  subordinates  to  answer  questions  for  or  receive 
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queries  from.  A  third  cause  may  ha*'e  been  the 
presence  of  ilectronlc  status  boards  milch  provided 
Information  without  the  verbal  cluer.  which  permitted 
the  transaction  to  be  recorded. 

Comparison  With  a  Manual  War  Game 

Coincident  with  the  conduct  of  this  experiment,  the 
"historical  logs*  of  a  game  run  at  the  war-gaming 
facility  at  the  Naval  War  College  at  Newport,  Rhode 
Island,  during  the  preceding  fall  became  available. 

This  game  also  involved  some  Inexperienced  players 
but  these  were  supported  by  trained  and  experienced 
Naval  officers,  and  both  sides  had  had  considerable 
time  to  develop  their  plans  before  the  game  com¬ 
menced.  In  addition  to  the  game  being  basically  a 
manual  one  without  automated  states  boards,  etc., 
there  were  severe1  levels  of  conmand  represented  on 
each  side.  Thus  It  was  quite  a  different  environment 
and  perhaps  more  closely  simulated  the  conditions 
which  one  might  expect  in  a  typical  command  node  in  a 
Navy  C2  system. 

In  order  to  obtain  some  comparative  data,  the  logged 
events  were  divided  Into  the  classes  shown  in  Figure 
7.  These  classes  were  chosen  as  being  more  repre¬ 
sentative  of  the  log  entries,  although  they  still 
retain  the  distinction  between  action  and  Information 
decisions.  (Although  these  classes  are  also  divided 
into  “input"  and  “output"  groups,  no  analysis  of  this 
dimension  has  yet  been  undertaken.) 

Using  these  data,  which  are  based  on  the  editing  and 
compilation  of  many  observers'  logs  Into  one  by  the 
game  historian,  similar  analyses  were  carried  out. 

Two  typical  plots  are  shown  In  Figures  8  and  9,  which 
depict  one  six-hour  period  In  the  game  at  two  levels 
of  command,  CTF  70.1,  the  Battle  Force  Commander,  and 
70.X,  one  of  his  four  Immediate  subordinates.  It 
should  be  noted  that  the  basic  time  unit  of  "wfdth" 
used  In  these  Figures  Is  one  minute  rather  than  the 
second  used  for  the  UES  data. 

While  the  most  striking  thing  about  these  data  Is  the 
similarity  of  the  plots  to  those  obtained  with  the 
data  taken  during  the  WES  war  games,  there  are 
several  other  Interesting  points. 

First,  the  activity  seems  to  be  peaked  at  the  level 
of  tiie  Battle  Force  commander.  While  not  displayed, 
the  plots  of  activity  at  the  level  of  SEVENTH  Fleet 
Comnander  show  only  about  one-sixth  the  number  of 
decisions  per  unit  time  that  are  evident  at  the  Force 
Comnander  level.  And  the  Force  Commander's  subordi¬ 
nate  Is  operating  at  about  half  his  tempo.  However, 
there  are  four  subordinates,  so  the  total  activity  Is 
higher  at  the  lower  level,  as  one  would  expect  It  to 
be. 

Second,  the  effective  mean  time  between  decisions 
seems  to  be  about  two  minutes,  rather  than  the  15  to 
20  seconds  seen  In  the  WES  game.  This,  of  course, 
may  b;  an  artifact  of  the  coarser  time  Increments 
used  in  the  Newport  logs,  as  well  as  the  less 
tactical  nature  of  the  decisions  being  made. 

And  thirdly,  the  statistics  on  types  of  transactions 
were  exactly  reversed  from  that  seen  at  WES. 
Two-thirds  of  the  decisions  Involved  Information  and 
only  one-third  or  less  Involved  action.  Whether  this 
was  due  to  the  lack  of  computer  aids  or  to  a  more 
realistic  simulation  of  a  command  situation  Is  not 
clear.  It  Is  hoped  that  additlor.al  light  can  be  shed 
on  this  subject  by  obtaining  some  more  data  at  the 
War  College,  which  has  had  a  major  computer 


facility  Installed  to  support  its  war  gaming 
functions  since  these  data  were  taken. 

Conclusions 

First,  It  seems  evident  that  one  can  define  space- 
time  events  (which  we  shall  call  decisions  or 
transactions)  which  deal  with  either  actions  or 
Information  and  which  are  made  observable  through 
utterances  or  keystrokes  mat*  by  the  Battle  Staff,  on 
which  different  observers  will  agree.  That  Is,  dif¬ 
ferent  observers  will  report  observing  the  same  type 
of  event  at  the  same  t1"'e.  And  It  seems  likely  that 
these  space-time  events  could  be  further  subdivided 
with  more  careful  training  and  discussion  among  the 
observers. 

Second,  the  gross  Inter- transaction  time  has  a 
distribution  not  unlike  a  Rayleigh  distribution,  and 
which  seems  to  be  a  common  behavior  even  In  quite 
different  sltutatlons. 

Third,  the  Inter-transaction  time  between  trans¬ 
actions  of  the  same  type  may  differ  between  different 
types. 

Fourth,  the  time  between  transactions  seems  to  be  a 
minimum  at  the  level  of  the  highest  “on-scene" 
Comnander.  This  Is  Interpreted  as  meaning  that  his 
“operating  tempo*  Is  forced  to  be  high  enough  to 
encompass  the  total  activity  below  him,  which  Is 
divided  among  several  subordinates. 

And  finally,  the  qualitative  differences  between  the 
Newport  data  and  that  taken  on  WES  may  Indicate  that 
Introducing  computer  assistance  Into  a  conmand  center 
will  change  the  nature  of  the  “transactions"  which 
one  sees  taking  place.  If  true,  this  may  have  seri¬ 
ous  Implications  for  the  design  and  organization  of 
future  conmand  control  systems. 
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Figure  1. 

Data  Font  Used  for  WES  Experiments. 
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Figure  2. 

Print-Out  of  Computed  Time  Intervals  as  Stored  In  Computer  Data  Files. 
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Figure  4. 

Distribution  of  Time  Intervals  Seen  by  Observer  Lombardo. 
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Figure  5. 

Time  Intervals  Between  "Action-  Decisions  ( Un-normal  1  zed). 
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Decision  Taxonomy  Used  for  Newport  data. 

The  distinction  between  directives  and  tasking  Is  that  directives  leave  the 
Method  of  Implementation  to  the  subordinate,  while  a  tasking  Includes  specific 
detailed  Instructions.  The  J£  category  was  used  for  remarks  by  the  historian. 
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Distribution  of  Time  Intervals  Observed  In 
Command  Center  of  Battle  Force  Commander . 
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Figure  9. 

Distribution  of  Time  Intervals  Observed  In  the 
Command  Center  of  a  Subordinate  Battle  Group  Conmander. 
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Summary 

This  paper  describes  a  preliminary  concept  for 
exploring  strategic  command  and  control  Issues  with  the 
automated  war  gaming  of  Rand's  Strategy  Assessment 
Center  (RSAC).  The  concept  features:  a  top-down 
functionally  oriented  approach  relevant  to  the 
interests  of  civilian  and  military  leaders;  a 
hierarchical  and  otherwise  multilevel  gaming  structure; 
and  heuristic  rule-based  models  using  a  variety  of 
artificial  intelligence  techniques.  The  approach  will 
be  sensitive  to  key  features  of  war  plans  and  control 
procedures.  It  will  make  a  start  on  reflecting  such 
phenomena  as  nonunitary  decisionmaking,  deception,  and 
confusion.  It  will  take  into  account  some  of  the 
asymmetries  distinguishing  the  U.S.  and  Soviet 

approaches  to  C3I.  Initial  versions  of  the  implemented 
concept  should  be  useful  and  interesting  but  will  be 
relatively  simple;  with  time,  it  should  be  possible  to 
evolve  gracefully  and  use  some  of  the  detailed  models 

available  on  pieces  of  the  overall  C  I  problem.  Even 
the  early  work,  however,  will  represent  a  major  break 

with  past  strategic  analysis  in  which  C3I  issues  have 
been  largely  ignored  but  for  limited  treatment  of 
communications . 

I.  Introduction 

This  paper  describes  a  concept  for  using  the 
emerging  technique  of  automated  war  gaming(l)  to 
explore  problems  of  strategic  command,  control, 
communications,  intelligence,  and  warning  (abbreviated 

here  as  CJI  or  as  "command  and  control”).  The  concern 
hsre  is  largely  with  the  architecture  of  an  approach 
rather  than  with  details,  many  of  which  remain  to  be 
worked  out.  Formal  architecture  is  essential  because 
we  seek  an  approach  that  will  be  broad  and  useful  in 
its  earliest  manifestations  and  that  can  evolve 
smoothly  over  time  to  address  a  substantia'  traction  of 
the  strategic  comaand-and-control  issues  oi  principal 
concern  to  nationat  leaders.  These  include: 
continuity  of  government;  timely  command  decisions 
involving  both  Intercontinental  and  theeter-nuclear 
forces;  continuing  control  over  those  forces;  and 
prosecution  of  conflict  (which,  depending  on 
circumstances,  might  call  for  decisive  military  action, 
controlled  escalation,  or  de-escalation).  In  future 
work  we  plan  to  go  beyond  the  current  paper's  emp  issis 
on  strategic  nuclear  weapons  and  to  extend  the 
architecture  to  cover  strategic  aspects  of  global 
conflict  generally. 

The  paper's  outline  is  as  follows:  first,  we 
describe  succinctly  the  principal  features  of  the  RSAC 
automated  gaming  system;  we  then  discuss  our  view  of 

what  the  strategic  C3I  problem  really  is--or  should  be 
considered  to  be  in  our  work.  Finally,  we  describe  the 
philosophy  of  our  approach  and  sketch  out  our  Intended 
plan  for  implementing  it. 


I I ■  Background 

The  Rand  Strategy  Assessment  Center  (RSAC)  is 
developing  a  new  approach  to  strategic  analysis  that 
attempts  to  combine  the  contextual  richness  and 
operational  complexity  of  war  gaming  with  the  rigor  and 
transparency  of  analytic  modeling.  On  the  one  hand,  we 
are  building  a  large-scale  simulation  mcdel  with  the 
structure  of  a  political-military  war  game.  In  this 
simulation,  models  represent  the  various  national 
players,  making  decisions  of  both  a  political  and 
military  nature.  The  simulation  can  be  fully 
automated.  On  the  other  hand,  much  of  the  RSAC's  work 
will  be  highly  interactive  with  human  teams  playing 
against  computerized  adversaries  or  changing 
assumptions  about  such  matters  as  combat  outcomes  to 
see  the  strategic  reaction  of  the  automated  players. 

At  the  technical  level,  the  RSAC  is  extending 
several  modern  techniques  in  artificial  intelligence 
(A. I.)  as  well  as  using  more  standard  modeling  and 
analysis  techniques. [1,2]  We  shall  not  discuss  the 
techniques  here.  Instead,  let  it  suffice  to  say  that: 
(1)  we  make  extensive  use  of  heuristic  rule-based 
modeling  in  an  English-like  programming  language;  (2) 
that  our  decision  models  use  such  devices  as  pattern 
matching  and  search  (with  lookaheads  accounting  for 
likely  opponent  behavior);  and,  very  importantly,  (3) 
that  contact  with  military  realism  is  achieved  in  part 
by  relating  a  (greatly  extended)  version  of  A. I. 
scripts  to  analytic  constructs  akin  to  war  plans. (1-3) 
From  the  viewpoint  of  A. I.  research  our  effort  is 
notable  because  it  is  a  rare  application  to  realistic 
high-level  military  Issues  and  because  the 
application's  scope  has  caused  us  to  develop  concepts 
for  managing  complexity  in  rule-based  models  that 
should  have  more  general  value. [4] 

Figure  1  provides  an  overview  of  the  RSAC  system 
emphasising  its  hierarchical  structure,  something  that 

3 

will  prove  important  in  treating  C  I.  The  first  column 
shews  the  nominal  meve  «eq-._>epcc  in  the  overall  geme. 

The  automated  players  arc:  (1)  Red  Agent  representing 
the  Soviet  Union;  (2)  Bine  Agent  representing  the 
United  States;  (3)  Scenario  Agent  representing  to  first 
order  all  nonsuperpower  countries  on  a  country-by- 
country  basis;  and  (4)  Force  Agent.  The  latter  model 
is  not  rsslly  a  player--rather,  it  keeps  track  of 
forces  worldwide  and  computes  the  results  of  battle  and 
other  military  operations  such  as  movement.  Its 
individual  submodels  are  typically  simple  and 
aggregated,  but  because  of  the  simulation's  breadth  and 
the  requirements  to  interrelate  phenomena  across 
theaters,  force  types,  and  levels  of  conflict.  Fores 
Agent  is  quits  complicated  overall.  There  is  also 
another  model  called  System  Monitor,  which  guides  game 
development  by  scheduling  moves  and  managing  interfaces 
with  automated  recordkeeping,  displays,  and  humans. 

The  second  column  provides  s  closer  view  of  what 
happens  in  a  single  major-agent  (Red  or  Blue)  move. 
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Fig,  1— A  Hierarchical  View  of  RSAC  Automated  War  Gaming 

The  move  begins  with  tue  agent  assessing  his  success 
with  a  previously  chosen  plan.  If  all  is  going  well, 
he  merely  continues  on  that  plan--which  is  represented 
in  code  by  RSAC  extensions  of  A. I.  scripts. (2)  All 
plans  (or  scripts)  have  bounds,  however,  and  if  any  of 
the  bounds  have  been  broken  (e.g. ,  by  excessive 
attrition  or  delay,  or  by  the  opponent's  escalation), 
then  the  Agent  must  reconsider.*  This  process  begins 
with  a  rule  set  associated  with  functions  of  the 
national  command  level  (NCL) .  The  NCL  chooses  a 
tentative  and  incomplete  war  plan  to  be  filled  out  and 
tested  by  the  area  command  level  (ACL),  which 
corresponds  loosely  with  the  functions  of  area 
commanders  such  as  U.S.  CINCs  or  Soviet  TVD  commanders. 
The  plan  testing  includes  a  lookahead  implemented 
through  the  tactical  control  level  (TCL),  which 
controls  the  interfaces  with  Force  Agent  and  (together 
with  Force  Agent's  submodels)  determines  many  of  the 
detailed  decisions  about  orders  of  battle,  allocation 
of  resources,  etc.  (decisions  that  should  net  be 
highlighted  in  a  strategic  level  game).  The  lookahead 
is  a  game  within  a  game  using  the  agent's  assumptions 
about  other  players'  actions  and  the  likely  results  of 
combat.  If  the  plan  passes  the  test,  it  is  then 
implemented,  again  through  the  TCL  level.  Otherwise, 
the  ACL  may  adjust  the  plan  and  try  again,  or  report 
back  to  the  NCL  that  some  strategic-level  decisions 
must  be  changed.** 

Continuing  to  unpeol  the  onion,  the  third  column 
provides  more  detail  on  what  happens  in  the  NCL.  The 
notable  feature  here  is  the  process  model  guiding  the 
structure  of  rule  sets.  In  practice,  we  must  rigidly 
define  the  permitted  forms  of  escalation  guidance, 
objectives,  and  strategies,  and  then  write  unambiguous 
rules  leading  from  game  observables  (e.g.,  combatants, 
location  of  conflict,  and  status  of  forces)  to  unique 
permitted  forma.  We  shall  not  discuss  such  matters 
here  even  though  they  are  consuming  a  major  amount  of 
effort  and  time. 

The  fourth  column  expands  upon  the  Choose  Strategy 
process  by  noting  that  strategies  must  be  chosen  for 
each  of  the  several  military  theaters,  including  the 
intercontinental  and  space  theaters,  and  then 


*  The  techniques  for  building  such  plans  and 
scripts  are  under  development  by  William  Jones,  Norman 
Shapiro,  and  Richard  Wise. 

**  The  sophisticated  reader  will  recognize  that 
Fig.  1  is  an  idealization  with  imperfect  fidelity  to  the 
actual  computer  programs.  In  the  code,  distinctions 
among  NCL,  ACL,  TCL,  and  decision  rules  in  Force  Agent 
are  sometimes  fuzzy  and  the  functions  alluded  to  In  the 
boxes  of  Fig.  1  are  sometimes  accomplished  by  rules 
distributed  throughout  the  program.' 


coordinated.  Finally,  the  last  column  expands  upon 
this  by  suggesting  some  of  the  many  steps  required  to 
define  theater-strategy  component*  that  would  be 
decided  (or  at  least  reviewed)  at  the  KCL.  These  would 
include:  (1)  consistency  of  actions  with  overall 
escalation  guidance  and  objectives;  (2)  cross-theater 
coordination;  and  (3)  resource  allocation  across 
theaters. 

Given  this  quick  overview  of  RSAC  system 
architecture,  let  us  note  socm  particular  items 
relevmt  to  what  follows: 

o  Variable  behavior  pattern*.  The  behavior 

patterns  of  Red,  Blue,  and  Scenario  Agents  are 
variable  to  reflect  fundamental  uncertainties 
about  the  true  patterns  to  be  expected. 

Hence,  we  speak  of  alternative  "Ivans," 

"Sams,"  and  third-country  "temperaments." 

o  Parametric  force  models.  Similarly,  Force 

Agent 1  a  component  models  are  highly  parametric 
with  the  parameters  chosen  for  strategic- 
level  analysis  (e.g.,  a  few  simple  equations 
that  calculate  bomber  prelaunch  survivability 
rather  than  a  complex  model  considering 
details  such  as  the  propulsion  characteristics 
of  a  Soviet  SLBM  that  might  be  used  to  attack 
bomber  bases). 

o  Use  of  scripts.  The  decision  models  do  not 
generally  extend  below  important  operational- 
level  issues.  Instead,  the  Agents  choose 
among  discrete  war  plans  in  the  form  of 
scripts,  each  of  which  contains  a  number  of 
microscopic  (and  sometimes  arbitrar*.’)  action 
instructions  (which  may  contain  slot  t  for 
parameters  to  be  filled  in  by  some  AcL  or 
Force-Agent  subroutine  at  the  appropriate 
time) . 

o  Unmodel able  Phenomena .  RSAC  games  allow 
certain  phenomena  to  occur  by  fiat  if  the 
analyst  so  chooees--e.g. ,  in  some  fraction  of 
game  runs  the  analyst  may  want  to  have  riots 
occur  in  Poland  if  certain  other  events  occur. 
Although  the  origin  and  nature  of  those  riote 
are  not  simulated.  Red’s  rules  may  be 
sensitive  to  whether  '.ha  riots  occur.  The 
riot  flag  is,  of  course,  a  surrogate  for  whole 
classes  of  important  real-world  events 
contributing  to  fog-of-war  effects  (and 
escalation).*  Such  devices  are  familiar  in 
manual  gaming  but  may  seem  unnatural  to 
traditional  modelers. 

Finally,  let  ua  summarize  the  RSAC  system's 
essential  elements,  distinguishing  between  variables, 
hard-wired  items,  etc.  This  is  actually  not  so  simple 
because  the  RSAC  system  is  designed  for  flexibility 
with  a  variety  of  users  who  have  different  notions 
about  what  should  be  hard-wired.  However,  Table  1 
shows  an  illustrative  breakdown  appropriate  for  an 
application  comparing  nuclear-employment  concepts  in  a 
range  of  scenarios.  Although  only  illustrative,  it 
demonstrates  that  there  are  many  pieces  to  the  overall 
system--all  of  which  must  be  considered  when  attempting 
to  treat  command  and  control. 


3 

III.  Defining  the  Strategic  C  I  Problem 

3 

Before  deciding  on  an  approach  to  the  C  I  problem 
we  must  first  know  what  it  is.  One  way  to  define 

3  3 

strategic  Cl  is  to  list  the  ingredients  of  s  C  I 
system:  (1)  decisionmaking  bodies  and  their  procedures; 
(b)  command  centers  to  integrate  information  for  the 
decisionmakers;  (3)  control  procedures  to  assure  that 

*See  Refs.  S  and  6  for  the  P.SAC's  conceptual 
approach  to  escalation  modeling. 
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Table  1 


ELEMENTS  OF  THE  RSAC  SYSTEM  AS  VIEWED  WHEN  ASSESSING 
ALTERNATIVE  STRATEGIES 


*i*ed 

Structure 

FU-d 

Charscterist ics(a) 

Variable  Principal 

Characteristics  Variables 

o  G«m  structure 

o  War  plena/ 

o  Sams,  Ivans,  o  Employment 

built  tround 

Scripts 

national  strategies 

two  (not  N) 

temperaments 

primary  pleyere 

o  Rules  for 

o  Force  models 

o  Initiating 

determining 

Scenario 

move  sequence 

o  Individual 
Scenario- 
Agent  rules 

0  Sequence  of 

o  Some  key 

some  NCL 

parameters 

decisions 

and  rules  in 

Fores, 

o  Treatment  of 

Scsnsrio,  and 

co— nd  levels 

Scripts 

|a|  In  other  applications  thasa  would  ba  considered  variable. 


decisions  will  be  implemented  if  transmitted;  (4) 
intelligence  systems  to  provide  strategic  and  tactical 
warning  of  attacks  and  other  information  on  enemy 
forces;  and  (S)  communications  to  provide  information 
on  one's  own  forces,  to  permit  transmission  of 
decisions,  and  to  permit  report -back  on  the  results  of 
execution 

It  is  important  to  remind  ourselves  that  C^I  does 
not  depend  on  physical  communication  systems  alone-- 
it  involves  analysis  by  the  NCA  and  his  staff,  as  well 
as  lower  command  levels,  decisionmaking  procedures . 
doctrine  (which  is  a  key  element  in  control  md 
planning),  and  many  other  items.  Indeed,  the  breadth 
and  complexity  of  command  and  control  are  such  that 
definitions  and  flow  diagrams  are  often  too  abstract  to 
communicate  what  the  down-to-earth  problems  people 
worry  about  (or  should  worry  about)  actually  are.  It 
is  therefore  useful  to  list  some  of  those  illustrative 
concerns,  drawing  on  the  public  literature  as  well  as 
our  own  knowledge.*  In  listing  these  questions,  we 
have  not  attempted  to  order  them  by  their  actual  or 
perceived  importance.  Further,  although  these 
questions  have  caught  the  common  fancy  as  "important" 

3 

examples  of  potential  C  I  failures,  we  cannot  exclude 
the  possibility  that  prospective  problems  are  more 
imagined  than  real  in  some  cases. 

Illustrative  Questlors 

1.  What  if  the  Soviets  attack  Washington  on 
Inauguration  Day?  Would  we  be  "decapitated?" 
What  does  "decapitation"  mean  on  an 
operational  level? 

2.  What  if  the  Secretary  of  Education  becomes  the 
National  Command  Authority  (NCA)7  Will  he 
know  enough  to  make  timely  strategic 
decisions? 

3.  What  *re  the  implications  of  delegation, 
nredelegatlon  for  contingencies,  and 
unilateral  lower-level  assumption  of 
authority? 

4.  What  are  the  implications  of  normal 
decentralization  of  authority  for  control  of 
events  relevant  to  escalation?  In  other 


*  The  unclassified  literature  on  strategic  C  I 
varies  widely  in  accuracy  and  quality.  For  an 
apocalyptic  and  influential  essay,  see  Ref.  7;  for  an 
interesting  (but  not  always  accurate)  survey,  see  Ref. 
8;  for  guarded  discussion  by  knowledgeable  experts,  see 
Ref.  9.  See  also  Refs.  10  and  14. 


words,  will  the  separate  commands  (CINCs  and 
Soviet  TVD  commanders)  be  taking  what  they 
regard  as  standard  measures  that  might  on  the 
one  hand  raise  the  likelihood  of  escalation  or 
on  the  other  hand  fail  adequately  to 
anticipate  the  requirements  of  nuclear 
conflict  or  make  nuclear  strikes  more 
lucrative?  (Examples;  ASW  operations, 
threatening  SSBNs  or  dispersal  of  nuclear 
weapons  in  conventional  conflict,  or--on  the 
other  end- -massing  of  forces  to  achieve 
improved  force  ratios  in  conventional 
operations.) 

5.  What  operational  constraints  narrow  the  NCA'a 
employment  options  (e.g.,  concerns  for  SSBN 
vulnerability,  weapon  range,  retargeting 
Inflexibility,  option  purity,  limitations  in 
assessment  capability)?  What  doctrinal 
constraints  similarly  narrow  his  options 
(e.g.,  failure  to  train  crews  for  massive 
retargeting)? 

6.  What  Soviet  actions  should  we  anticipate  early 
in  conflict  by  virtue  of  Soviet  doctrine's 
emphasis  on  preparing  for  the  nuclear  phase? 
How  should  we  prepare  to  observe,  understand, 
protect  against,  and  react  to  such  measures? 

7.  What  if  we  lose  some  or  all  communications 
(one-way  or  two-way)  to  the  ICBMs,  SSBNs, 
bombers,  and/or  CINCs...?  Could  we  lose  the 
capability  for  assured  retaliation,  limited 
responses,...?  Is  the  EMP  threat  real  and 
potentially  devastating?  What  if  we  lose 
communications  to  the  Soviets?  Will  that 
preclude  termination  short  of  unrestrained 
general  nuclear  war? 

8.  What  if  we  lose  early  warning  satellites 
from  antisatellite  attacks,  sabotage  on 
the  ground,  system  failures,  or  unknown 
reasons? 

9.  Assuming  the  potential  NCA  desire  to  make 
limited  responses  to  nuclear  attacks,  what 
capabilities  at  what  level  will  be  necessary 
to  make  appropriate  limited  responses 
possible--not  only  at  the  outset  of  conflict, 
but  as  a  function  of  time  thereafter? 

10.  What  if  the  results  of  initial  conflict  are 
sharply  different  from  those  anticipated-- 
because  of  system  failures,  surprise  tactics, 
or  whatever?  What  capabilities  are  needed  to 
permit  at  least  modest  replanning? 

11.  What  if  communications  are  adequate  initially 
but  rapidly  degrading?  What  are  the  effects 
on  crisis  stability  and  future  ability  to 
prosecute  the  conflict? 

12.  To  what  extent  is  a  counterforce  war  made 
infeasible  by  fragility  in  the  command  and 
control  system?  What  are  the  implications  of 
one  or  both  sides  having  command  and  control 
inadequate  to  prosecute  an  extended  conflict?* 

13.  To  what  extent  does  the  nature  of  actual  war 
plans  circumscribe  the  feasibility  of  limiting 
the  scope  of  nuclear  war,  once  it  begins?  Are 
they  so  inflexible  as  to  preclude  controlled 
responses  or  are  they  in  fact  adequately 
flexible  given  the  limited  number  of  plausible 
options,  the  relative  predictability  of 
certain  aspects  of  strategic  nuclear  war,  and 
the  difficulty  or  cost  of  achieving  enduring 
command  and  control  to  support  more  fine- 
tuned  responses? 


*  Interestingly  enough,  it  is  sometimes  argued  that 

enduring  cl  capability  would  be  destabilizing,  a  view 
that  would  shock  most  defense  professionals.  See,  for 
example.  Ref.  10. 
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The  list  reinforces  the  conclusion  that 

dressing  strategic  C^I"  scans  addressing  an 
irsous  range  of  Issues  involving  everything  from 
indard  operating  procedures  at  the  operational  level 
the  laplications  of  satellite  vulnerability.  Upon  a 
tent's  reflection  it  it  also  evident  that  the  answers 
the  questions  posed  depend  upon  such  diverse 
riables  as:  (1)  nature  of  the  superpower  leaderships 
conflict;  (2)  succession  and  devolution 
rangemects;  (3)  locations  and  levels  of  conflict;  (41 
(or  history  (initiating  scenarios);  (S)  status  of 
.ces  by  type  and  theater;  (6)  range,  quality,  and 
txibility  of  preplanned  options;  (7)  the  enemy's 
trail  strategy;  (8)  technical  issues  such  as  the 
rvivability  of  many  systems  (or  functions);  and  (9) 

3 

constitution  capability  for  each  component  of  C  IW. 
shall  begin  to  address  then  in  Sec.  IV. 

IV,  Conceptual  Architecture  for  an  Approach 
sic  Considerations  Governing  Approach 

Section  III  demonstrates  that  handling  C^I  within 
a  structure  of  an  ordinary  Modeling  approach  is 
■ply  not  feasible --too  many  of  the  issues  are  less 
antitative  and  naturally  analytic  than  operational  or 
havioral,  and  it  would  be  fruitless  to  try  reducing 
r  problem  at  hand  to  a  very  smell  number  of  simple 
irlables.  By  contrast,  the  emerging  capabilities  of 
itomated  war  gaming  will  be  an  ideal  vehicle  for 
ploring  many  of  the  issues  systematically.  Indeed, 
icognltion  that  gaming  was  probably  essential  in 
'eating  effects  such  as  command  and  control  underlay 
ich  of  the  initial  government  interest  in  the  RSAC 
roject. (11)  The  RSAC  project  is  now  far  enough  along 

>  that  making  this  idea  a  reality  is  a  high-priority 
:em. 

The  most  important  premise  governing  our  approach 

>  strategic  command  and  control  is  that  such  issues 
Muld  be  reflected  in  the  very  fabric  of  the  RSAC 
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yatem--to  view  C  I  as  merely  one  more  effect  for  which 
program  "module"  needs  to  bo  developed  would  be  to 
isunderstand  utterly  the  nature  of  the  problem, 
ndeed,  it  would  be  clot or  to  the  mark  were  we  to  say 
hat  the  various  and  sundry  RSAC  models  should  be 
nbedded  within  the  fabric  of  a  command -and -control 
onstruct  than  vice  versa. 

All  this  Implies  that  we  need  a  conceptual 
rchitecture  for  our  approach  rather  than  a  mere  grab 
ag  of  physical  models  and  artificial  intelligence 
echniques.  Ve  shall  now  sketch  out  what  we  see  as 
esign  requirements  for  our  effort  and  then  provide  the 
utline  of  our  intended  approach. 

Pea ign  Requirements .  Upon  reviewing  the  state  of 
he  art  in  strategic  .analysis,  the  major  strategic 
.ssues  of  the  day,  as  we  see  them,  the  nature  of  the 
ISAC  charter  from  DoD,(l]  and  the  conclusions  of  some 
■ast  DoD  work, (12]  we  have  developed  the  following 
irinciples  as  guidance: 

1.  As  mentioned  above,  the  command-and-control 
issue  should  be  reflected  throughout  the 
fabric  of  the  simulation  and  not  merely  in 
some  "module." 

2.  By  virtue  of  our  strategy-level  focus,  the 
approach  should  be  top-down  rather  than  bottom 

3 

up.  This  implies  we  should  be  focusing  on  C  I 
functions  rather  than  individual  systems; 
moreover,  it  means  that  the  key  game  variables 
and  displays  should  be  aggregated  and  in  a 
form  natural  to  strategic- level  discussion. 

3.  The  character  of  the  system  must  account  for 
the  existence  of  multiple  levels,  and 
locations  within  levels,  of  command  and 


control  authority.  Moreov.r,  it  should 
reflect  hierarchical  phenomena.* 

4.  Even  our  early  efforts  to  reflect  command  and 
control  should  be  useful  and  realistic.  It  is 
better  to  reflect  som>>  of  the  real  command  and 
control  issues  early  than  to  treat  command  and 
control  comprehensively  for  a  "toy  problem”  of 
no  direct  value. 

5.  However,  the  approach  should  be  evolutionary 
and  should  allow  linkup  to  some  of  the 
detailed  work  being  conducted  within  the 
defense  community  on  such  matters  as 
communications  connectivity. 

6.  Although  an  evolutionary  approach  is 
appropriate,  it  should  be  broad-based  from  the 
outset-*touching  insightfully  upon  both  U . S', 
and  Soviet  command,  control,  communications, 
intelligence,  and  warning  rather  than  dwelling 
exclusively  on,  for  example,  U  S.' capability 
to  communicate  an  Emergency  Action  Message. 

7.  The  approach  must  permit  the  analyst  (or  game 
director  when  human  teams  are  involved)  to 
introduce  phenomena  representative  of  the  fog 
of  war --phenomena  such  as  may  arise  from 
causes  as  diverse  as  unconventional  warfare, 
flukes  of  nature,  or  catastrophic  weapon- 
system  failure.** 

Elements  of  Architecture.  To  move  from 
requirements  to  an  architecture  we  must  first  think 
about  what  would  constitute  an  architecture.  How  would 
we  know  if  N«  had  one?  Remember  here  that  we  are  not 
dealing  merely  with  the  design  of  a  communicat ions 
.system.  Rather,  we  /.re  dealing  with  an  approach  to  the 
design  and  application  of  an  interactive  war-game- 
based  simulation.  Upon  reflection,  and  upon  looking 
back  at  Table  1 ,  which  itemized  the  elements  of  the 
RSAC  automated  gaming  system,  it  seems  we  must  provide 
four  different  items: 

o  A  suitable  structure  for  the  simulation  (one 
that  will  provide  the  appropriate  perspective 
on  a  multifaceted  problem). 

o  Variables  (and  corresponding  data  structures) 
suitable  for  reflecting  command-and-control 
factors  simply  and  transparently  in  rules  and 
algorithms. 

*  The  hierarchical  principle  of  complex  systems  is 
what  underlies  the  frequently  mentioned  analogy  between 
living  systems  and  command  and  control.  Each  level  of  a 
hierarchy  has  a  recognizable  separate  existence  and  a 
, set  of  internal  processes.  It  communicates  up  and  down 
the  hierarchy,  but  the  communications--however 
important--represent  only  a  small  fraction  of  the 
activities  and  are  of  little  concern  to  most  components 
of  that  level.  Moreover,  communications  can  be  delayed, 
imperfect,  Inappropriate,  misunderstood,  etc. 
Manifestations  of  hierarchical  effects  are  familiar  to 
students  of  organizational  theory,  large  business  and 
government  operations,  biologists,  and  certain 
philosophers  such  as  Arthur  Koestler  who  see 
hierarchical  principles  as  having  broad  applicability. 

We  should  note  tfcrt  it  is  one  thing  to  build  nested 
multilevel  models,  which  are  by  no  means  unusual,  and 
quite  another  to  reflect  hierarchical  principles 
adequately. 

**  We  wish  to  achieve  here  some  important  features 
of  manual  gaming  in  which  the  Control  Team  can  force 
teams  to  focus  on  events  that  are  systematically  left 
out  of  typical  model-generated  scenarios.  For  example, 
a  Control  Team  can  decree  to  the  Blue  Team  that  its 
early-warning  capability  h„s  vanished,  and  thereby  force 
the  Blue  Team  to  think  out  what  it  would  do  in  such  a 
situation.  The  Control  Team  does  not  have  to  explain  in 
detail  how  the  catastrophe  occurred  (although  good  game 
practice  would  entail  a  plausible  explanation).  See 
Ref.  3. 
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Wa  shall  now  discuss  our  Intended  approach  and 
touch  upon  each  of  these  items  in  turn. 

Outline  of  a  Conceptual  Approach 


Algorithms  and  rules  for  manipulating 
variables  and  data  structures  to  model  desired 
effects. 

3 

A  management  roadmap  for  assuring  that  C  I 
issues  are  introduced  consistently  throughout 
the  complex  RSAC  system. 


Simulation  Structure .  We  shall  not  discuss  the 
issue  of  simulation  structure  in  much  detail  here 
because  it  should  be  reasonably  cleat  from  Fig.  1  and 
the  attendant  discussion  that  we  anticipated  most  of 
the  structural  issues  from  early-on  in  the  -v-yraa.  In 
particular,  the  RSAC's  basic  system  design  is  weii- 
sulted  to  treatment  of  hierarchical  and  otherwise 
multilevel  effects;  moreover,  it  is  flexible,  modular, 
and  designed  with  the  expectation  of  evolutionary 
development  from  simple  rule-based  models  to  more 
sophisticated  models  using  results  of  detailed  work  on 
such  problems.  Although  it  will  be  some  time  before  we 
make  extensive  use  of  the  capability,  the  system  design 
also  permits  us  to  maintain  separate  data  bases  for 
Red,  Blue,  and  "Real  World."  That  is,  our  data 
structures  permit  us  to  have  Red,  Blue,  and  Force  Agent 
to  see  different  data  bases  on,  for  example,  the  status 
of  Blue's  forces.  Handling  such  <ffects  is  painful  for 
the  analyst  but  will  be  essential  for  looking  into 
matters  such  as  deception  and  the  fog  of  war. 


Variables  and  Data  Structures.  Discussions  of 
rule-based  models  sometimes  seem  to  suggest  that  rule 
writing  is  easy  and  that  structure  is  unnecessary- 
all  that  is  necessary  is  to  find  an  "expert."  In  fact, 
however,  there  are  many  instances  in  which  experts  able 
to  provide  a  complete  and  incisive  sot  of  rules  simply 
do  not  exist.  In  that  case,  which  generally  applies  to 
comm and- ami-control  issues,  it  is  necessary  for 
analysts  to  do  a  great  deal  of  background  work  to  help 
experts  think  clearly  and  cover  all  the  bases. (4]  This 
implies  anticipating  (to  the  extent  possibin)  what  the 
experts  will  eventually  find  to  be  the  most  natural  way 
to  express  rules  simply  and  understandably  from  a 
strategic-level  perspective.  That  is,  we  must 
anticipate  the  appropriate  variables,  related  data 
structures,  and  logic  flows. 

Analyt  eally,  a  major  problem  he.-e  is  that  the 
"natural  variables”  for  those  building  the  individual 
pieces  of  the  RSAC  simulation  are  often  not  the  natural 
variables  in  which  to  express  particular  rules.  For 
example,  the  RSAC  has  a  world  data  base  with 
information  on  worldwide  forces,  national  orientations, 
etc.  The  data  are  collected  initially  in  forms  driven 
by  the  models  that  track  locations  and  status  of 
individual  forces  and  the  like.  However,  this  form  is 
too  disaggregated  and  disorganized  relative  to  what  we 
need  either  to  write  simple  decision  rules  involving 
command  and  control,  cr  to  write  simple  rules  or  models 
describing  command-and-control  effects  in  the  execution 
of  options. 

With  these  considerations  in  mind,  we  are 
cvcr-ntly  working  out  on  a  classified  basis  the  details 
of  an  approach  outlined  in  Table  2.  Some  of  the  basic 
notions  here  are  as  follows; 


o  It  is  useful  to  construct  three  time-dependent 
state  vectors  in  addition  to  0,  which 
characterizes  the  "world  data  base”  as  it  is 
evolving  in  RSAC  work.  The  three  vectors,  C 

(with  subvectors  for  the  C^I  and  W  components 

3 

of  C  I),  and  S,  and  N,  pertain  to  the  states 

of  C^I ,  functional  support  for  the  NCA,  and 
NCA  capabilities,  respectively. 


Table  2 

ORGANIZING  DATA  FOR  SIMPLIFIED  RULE-WRITING 
SENSITIVE  TO  COMMAND  AND  CONTROL 


State  of 
Saaic 

ASM;  Syataa 
Obaarvablea 


O  -  C  -  S  -  N 

State  of 

State  of  NCA  Support  State  of 

Caeeand  and  -  (Functional  -  NCA 

Control  Capabllitiea)  Capability 


Oft)  •  (unitructured  data  on;  atatua  of  foreea  and  n«» ions'  ear 
plana  (acripte)  bains  iapleaented,  attrition  rataa,  ratea 
or  aoveeent , . . . ) 

CCt)  -  (C,,  Cr  Cj,  I,  W) 

C,(t)  •  (Nature  of  NCA,  btent  of  Delegation;  Extent 

of  Continteney  Freda  legation;  Nature  of  NCA 
Staff;  Degree  of  Information  Saturation) 

Cj(t)  •  (Lower-Level  Capability  to  Respond  (to 

higher-level  orders),  Lower-Level  Willlngneee 
to  Respond  (to  higher- level  conaands)) 

Cj(t)  •  (Communications  (by  geographic  region)  to:  1CSNS, 

SSSNs ;  goobers;  SLCH  launch  platforoa;  Satellites; 
ASAT  systems;  Other  Strategic  Defensive  Forces 
(S.Uto,  ABKs,  interceptors) ;  Nonetrategic  CINCs) 

Itt)  ■  (Intelligence  on;  Nature  of  enemy  NCA; 

Nature  of  enemy  NCA  support;  Enemy  ICBNs,  SSSNs, 

SLCH  launch  platforms,  bombers,  satellites,  ASAT 
system,  and  Other  Strategic  Defensive  Forces  (SANs, 
ABMs,  interceptors ;  and  enemy  fortes  in  theaters) 
W(t)  -  (Warning  of  attack  by:  Ballistic 
Missiles;  Air  Breathers) 

S(t)  •  (Ability  of  the  NCA  to  Obtnin  Finished  Assessments  of: 
status  of  hla  forces,  force  operations,  and  alliances; 
status  of  the  enemy  NCA,  NCA  support,  forces,  end  alli¬ 
ances.  Ability  of  rbe  NCA  support  ate//  to  Use, 

Develop,  end  Evaluate  Options,  both  before  and  after 
execution.  Ability  of  the  NCA’n  forces  to  Execute 
Options.) 

N(t)  »  (NCA  ability  to:  Assess  Option  Feaslbllty;  Modify 

or  Originate  Options;  Compere  and  Choose  Among  Options; 
and  Communicate  the  Chosen  Option.) 

Transformations  between  these  vectors  should 
be  thought  of  as  transforming  raw  data  on 
system  observables  into  forms  more  convenient 
for  rule  writing. 

o  Each  of  the  state-vector  components  and  their 
time  trendr  should  be  definable,  for  our 
purposes,  in  highly  qualitative  terms.  For 
example,  we  may  characterize  the  quality  of 
the  first  component  of  Cj,  communications  to 

ICBMs  in  a  given  geographic  region,  as:  poor, 
moderate,  or  good. 

o  We  would  expect  to  write  nearly  all  rules 

involving  the  command-and-control  influence  on 
option  selection  in  terms  of  the  vector  N  (and 
Cj,  which  detwi mines  the  major  agent's 

chareuter);  other  rules,  however,  (e.g. ,  Force 
Agent  rules  on  option  execution)  may  depend  on 
S,  C,  or--in  rare  instances--informat.ion  found 
only  in  the  world  data  base, 
o  In  defining  the  state  vectors,  their 
components,  and  the  values  of  their 
components,  we  must  be  cautious  to  maintain 
resolution  distinguishing  among  the  following, 
'••’en  in  early  work: 


-•  Theaters:  intercontinental, 

space,  others  (Europe,  SWA,...), 
and  simultaneous  multitheater 
operations 

--  Strategic  Forces:  ICBMs,  SLBMs, 
bombers,  SLCMs,  Space  Forces, 
and  ASAT  Forces 

—  Time:  crisis;  extreme  crisis  and 
possible  theater  war;  period  of 
U.S.  first  strike;  period  of 


Soviet  first  strike;  period  of 
immediate  U.S.  response,  if  any; 
initial  aftermath;  and  extended 
aftermath  (see  Fig.  2,  but  note 
also  the  possibility  of  more 
complex  stop-and-start  wars) 

-•  Opt  ion  Class :  e.g.,  limited 
versus  massive  counter  force 
options  with  modest  or  major 
coordi-nat ion  problems 
(including  theater  missions  for 
strategic  forces)  limited  and 
massive  countervalue  options, 
and  mixed  options,  in  each  case 
executed  as  a  first-strike. 
Launch  Under  Attack,  Pronpt 
secord-str ike ,  Delayed  second- 
strike,  or  Follow-on  strike. 

-•  Employment  Concept •.  distinctions 
among  options  calling  for  the 
same  results  to  be  achieved  with 
different  missions  for  the 
individual  force  types  (e.g., 
striking  the  same  targets  with 
bombers  as  opposed  to  ICBMs). 

--  Cl a; s  of  Effect :  effects  on 
ability  to  choose,  quality  of 
choice,  and  speed  of  choice. 

Strategic  Warning  fer  U.S. 


Tactical  Warning  fof  U  S. 


Attacki  on  SpKi  Syxtamt 


Launch  urdaf  attack  Replanning.  Follow-on  Strikaa.  ate. 


I<ow -Medium : 


High-Medium: 


High: 


retargeting.  No  capability  to 
change  theater  war  plans. 
Preplanned  options  of  classes  X 
and  Y  with  retar-get ing  of 
force  elements  A  and  B  feasible 
(where  A  and  B  are  the  force 
elements  for  which  retargeting 
is  most  plausible).  No 
capability  to  change  theater 
war  plans  from  the  NOA  leval. 

As  abo'»e,  except  full  retar¬ 
geting  across  force  types 
within  preplanned  options. 

Full  nominal  capabilities  as  of 
some  future  date,  including  in 
particular  the  ability  to 
retarget  against  newly  acquired 
targets  in  ad  hoc  options. 


These  definitions  allow  a  few  simple  "values”  to 
cover  a  number  of  issues.  We  would  also  need 
separately  to  characterize  the  components  in  terms  of 
timeliness  of  decision  (e.g.,  normal  or  slow). 

Obviously,  he  approach  involves  an  article  of 
faith  to  the  effect  that  the  strategic  comraand-and- 
control  pi  obi  era  can  be  reduced  to  describing 
capabilities  and  phenomena  in  a  large  but  highly  finite 
number  of  crudely  defined  discrete  states,  preferably 
states  that  can  be  sunroa  ized  briefly  in  intuitive 
terms.  Considering  that  most  strategic  nuclear 
analysis  implicitly  assumes  the  state  of  perfect 
command  and  control  (except  for  zero  strategic 
warning),  we  need  hardly  apologize  for  an  approach  that 
will  distinguish  among  tens  (or  perhaps  hundreds)  of 
states.  How  much  disaggregation  will  be  necessary 
remains  to  be  seen. 


A  — 

Crina  Extrema  Soviet 

crisis/  first 

th#*:-*  war  strife* 

(launch) 

Fig,  2- — Illustrative  Time  Periods  and  Events 

o  The  NCA's  ability  to  perform  the  functions 
listed  in  N(t)  will  depend  strongly  on  the 
types  of  options  und^r  consideration. 

Although  options  can  theoretically  be  indexed 
by  all  components  of  the  p-eceding  bullet,  we 
thus  far  believe  that  Time  and  Option  Class 
are  the  most  important  characteristics  for 
option  indexing.  Thus,  the  NCA's  ability  to 
assess  option  feasibility,  for  example,  should 
be  understood  to  be  with  respect  to  each 
Option  Class  component  within  each  Time 
component.  For  each  such  combination  we  are 
now  in  the  process  of  defining  High,  Medium, 
and  Low  labels  for  the  components  of  N(t). 

Even  further  distinctions  will  clearly  be  necessary 
when  we  begin  to  delve  more  into  theater- level  Issues, 
but  the  above  list  is  already  intimidating. 

Although  the  many  distinctions  iLemized  above  may 
seem  to  imply  that  aggregation  can't  work,  in  fact 
there  appear  to  be  many  possible  simplifications.  For 
example,  in  characterizing  the  capability  of  the  NCA  to 
develop  new  options  during  the  "Period  of  Immedi ate 
Response,"  we  might  use  Low,  Low -Medium,  High -Medium, 
and  High  as  the  principal  descriptors  with  these 
def init ions : 

Low:  No  capability  except  for  cxecu- 

t on  of  preplanned  options 
in  class  X  (X  to  be  defined  in 
terms  consistent  with  the  war 
plans/scripts  available).  No 


Roadmap  to  Integrat icn .  Assuming  that  the 
structure  we  have  outlined  provides  an  appropriate  view 
of  the  problem,  and  that  its  states  are  defined  for 
rule  writing  and  model  building  by  using  the  natural 
variables  of  the  command-and-control  problem,  the  next 
challenge  is  to  manage  the  implementation. 
Unfortunately,  this  is  inherently  difficult  because,  as 
3 

repeatedly  stated,  C  I  permeates  everything  and  must 
therefore  affect  the  work  of  numerous  people  working  on 
different  parts  of  the  RSAC  project.  There  are  at 
least  three  aspects  to  managing  the  work  in  such  a 
case:  (1)  rule  writers  and  model  builders  must  have 

checklists  of  items  to  consider,  thereby  reducing  the 
likelihood  that  Red  will  write  rules  sensitive  to  some 
command  and  control  issue  that  Blue  will  ignore,  except 
in  those  cases  where  underlying  strategic  asymmetries 
dictate  valid  differential  sensitivities;  (2)  there 
must  be  a  mechanism  of  integration  in  which  the  various 
contributors  systematically  read  each  other's  material, 
compare  notes,  and  look  for  incompatibility;  and  (3) 
there  must  be  formal*  "walkthroughs"  of  the  overall 
simulation  on  every  command  and  control  issue  expected 
to  be  important. 

k'*1  cannot  discuss  these  matters  in  much  more 
detail  he. o ,  but  we  can  point  out  a  few  items  of 
interest.  For  example,  upon  reflection  we  find  it 
useful  to  distinguish  clearly  between  command-and- 
control  effects  on  decisions,  and  command-and-control 
effects  -n  force  operations.  Figure  3  makes  this 
distinction  and  pomts  out  that  all  of  the  RSAC  agents 
are  affected.  Note  that: 

o  The  state  of  strategic  command  and  control 

(and  the  projected  state!)  must  affect  Red  and 
Blue  decision  rules  by:  (a)  affecting  Red  or 
Blue  character,  war fight ing  ability,  and 
efficiency;  (b)  limiting  Red  or  Blue  options; 
and  (c)  shading  the  perceived  attractiveness 
of  alternative  available  options.  Similarly, 
Scenario  Agent's  decision  rules  must  be 


—  ■  TIME 

Intermediata  Initial  Extended 

U  S.  reaction  aftermath  aftermath 
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Optional  w*r  plant  Na^on-by-nation  rules 
affected  by  and  aanaitiva  to  superpower's 

affecting  C’l  character  (aapacially 


Rulat  on  option  aalaction  Algorithms  and  rules 
baaad  larpaty  on  atata  daacriblnp  result*  of 

vector  In  and  eharactar  option  aaacution  at  a 

of  NCA  function  of  command 

and  control  variablaa 


Fig.  3— Command  and  Control  Effects  on 
Blue  Move  and  Results  Thereof 

sensitive  to  the  nature  of  the  superpowers ' 
national  command  authorities  and  the  overall 
effectiveness  of  those  superpowers, 
o  The  Force  Agent  must  reflect  eommand-and- 
control  effects  on  force  operations  (and  of 
projections  thereof)  by  m-ans  of:  (a)  delays 
and  related  mismatches  between  decisions  and 
current  world  states;  (b)  errors  such  as  those 
caused  by  poor  intelligence  or  communications; 

(c)  degraded  capabilities  such  as  loss  of 
certain  types  of  warning  or  intelligence;  and 

(d)  coordination  problems. 
a  The  analytic  war  plans/scripts  must  reflect  to 

some  degree  the  partially  independent 
operations  of  individual  theater  commanders 
and  the  potentially  parochial  decision  rules 
governing  those  operations.*  They  should 
reflect  doctrinal  behavior  at  the  operational 
level  except  where  there  are  good  reasons  to 
assume  otherwise.** 

o  The  analyst  using  the  RSAC  system  oust  be  able 
to  insert  "fog-of-war"  effects  and  other 
related  phenomena  easily,  something  that  has 
Implications  for  System  Monitor  in  particular, 
but  also  for  the  other  Agents  (i.e.,  them 
must  be  variables  created  to  serve  as 
surrogates  for  the  effects  in  question;  the 
variables  must  be  represented  in  all  of  the 
separate  agents). 

All  of  this  is  rather  abstract,  so  it  is  useful  to 
provide  at  least  a  partial  image  of  what  is  involved  in 
implementing  the  concept.  Thus,  let  us  discuss  what 
might  be  involved  in  reflecting  Just  one  particular 
issue,  Red's  assessment  of  Blue's  LUA  capability.  Such 
an  assessment  might  be  important  in  Red's  detailed 
attack  planning  if  the  implications  of  a  U.S.  LUA  were 
major.  Figure  4  suggests  a  somewhat  oversimplified 
logic. 

To  implement  this  logic  in  the  RSAC  system  one 
would  have  to  do  the  following: 

*  Ultimately,  we  hope  to  reflect  independent 
operations  by  commanders  at  levels  lower  than  the 
theater.  In  the  relative  near  term,  however,  we 
will  omit  such  considerations. 

**  This  has  management  implications  because 
it  suggests  that  we  should  invest  in  having  separate 
teams  develop  the  war  plans  for  the  individual 
theaters  rather  than  building  the  plans  from  a 
purely  top-down  perspective  that  would  tend  to  make 
the  analytic  plans  used  in  the  computer  model  come 
out  far  more  coordinated  and  mutually  reinforcing 
than  is  realistic.  Unfortunately,  developing  such 
separate  plans  is  manpower  and  expert  intensive, 
especially  because  of  the  effort  required  to  train 
teams  of  analysts. 
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Fig.  4— A  Simplified  Logic  Tree  for  Red's  Assessment  of 
Blue's  Ability  to  Launch  Under  Attack 

1.  Create  Red  war  plan  components  chat  would,  ii 
successful,  destroy  Blue’s  warning  satellites 
and  radars. 

2.  Build  Force  Agent  models  to  estimate  the 
effectiveness  of  such  Red  options  under  a 
variety  of  circumstances  (e.g.,  the  status  of 
Red's  antisatellite  systems  and  space-tracking 
network,  the  number  and  vulnerability  of  U.S 
satellites,  the  locations  of  Red's  SSBNs 
capable  of  attacking  U.S.  warning  radars). 

3.  Build  Red  decision  rules  sensitive  to  Red's 
assessment  of  Blue's  LUA  capability  (e.g., 
ruler  affecting  Red’s  willingness  to  launch  a 
first  strike  or  rules  affecting  the  rice  and 
nature  of  a  first  strike;  also,  rules  relating 
Red's  desire  to  prevent  U.S.  LUA  to  other  Red 
actions  that  would  provide  the  United  States 
with  strategic  warning), 

4.  Build  Red  rules  assessing  Bluo’s  LUA 
capability  (as  in  Fig.  4)  and  relating  the 
items  in  the  figure  to  the  war  plan  components 
mentioned  in  (1)  and  Blue's  probable  response 
to  other  Red  war  plan  components  as  mentioned 
in  (5). 

5.  Build  Blue  rules  sensitive  to  Indication  that 
warning  satellites  are  under  attack  (e.g.,  go 
on  highest  alert)  and  rules  establishing 
whether  Blue  would  actually  try  to  launch 
under  attack  under  some  circumstances  (a 
function  of  policy  and  capabilities,  which 
might  be  quite  different  from  those  assumed  by 
the  Soviet  Union,  whose  strategic  doctrine  has 
stressed  LUA  for  years). 

Although  this  is  only  a  narrative  sketch,  it  is 
sufficient  to  demonstrate  once  again  that  incorporating 
coramand-and-control  effects  is  an  inherently  complex 
business  .demanding  that  attention  be  paid  to  details  of 
scenario,  strategy,  the  two-sided  nature  of  the  game, 
etc. 

Where,  then,  do  we  stand  at  this  point  in  our 
development  program?  Is  this  all  conceptual,  or  are  we 
actually  implementing  the  ideas?  At  the  moment,  we  are 
within  a  few  months  of  automating  the  most  recent 
version  of  the  basic  RSAC  system,  having  conducted 
seoiautomated  experiments  last  summer. [13]  Once  the 
basic  system  is  operational,  we  plan  to  incorporate 
selected  command-and-control  effects  on  a  simplified 
basis  using  heuristic  rules  tied  to  grossly  defined 
world  states  (e.g.,  have  the  Soviets  already  attacked 
warning  satellites?).  We  then  expect  to  implement  a 
more  ambitious  but  still  first-generation  of  the 
overall  architecture,  probably  in  November  or  December, 
1983.  Finally,  we  expect  to  build  more  sophistication 
into  the  system  over  the  period  of  several  yeais-- 
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including  explicit  tie-ins  to  the  results  of  detailed 
models  such  as  those  used  to  estimate  connectivity  to 
bombers  as  a  function  of  weapon  lay-down  and  scenario. 
We  plan  to  use  structured  human  gaming  as  a  source  of 
insight  and  rules.  Our  expectation  is  that 
applications  will  be  possible  early  next  year,  well 
before  we  have  much  sophistlcation--priraarily  because  a 
major  contribution  of  the  effort  will  be  a  war  game 
framework  requiring  consistency  from  move  to  move  and 
requiring  the  human  or  automated  players  to  take  first- 
3 

order  C  I  effects  into  account  when  developing  their 
overall  strategies. 
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SECTION  1 


V  INTRODUCTION 

f~~ ITT  this  paper  we  describes  and  compare* 
two  modal*  of  air  defense  symtams  which 
focus  on  tha  command  and  control  ''  ?  ... 

aspacts  of  tha  air  dafansa  mission.  Tha 
modal s  ara  both  computar-basad 
raprasantations  of  how,  through  tha 
execution  of  C-^fi  functlors,  a  defansiva 
forca  datacts  and  dastrcys  an  offansiva 
forca  of  alrborna  penetrutors.  However  tha 
two  modal s  ara  complataly  different  in  tha 
manner  in  which  this  engagement  is 
raprasanted.  Tha  first  modal,  called  QUEB 
for  “^queuing  based,  ^consists  of  a  large  sat 
of  simultaneous  aquations  derived  using 
queuing  theory  which  solve  for  tha  dynamic 
steady-state  conditions  existing  throughout 
tha  system.  Tha  second  modal,  called  TADZ 
f  or  ^**tr  ansi  ant  air  defense  zone,**'ls  a 
so-called  event-stappad  simulation  of  tha 
air  dafansa  engagement.  ^ _ _ 


Wa  first  describe  an  underlying 
skeletal  or  conceptual  approach  to  modeling 
command  and  control  I  this  skeletal  approach 
is  reflected  In  both  OUEB  and  TADZ.  We  than 
describe  each  model  in  turn.  Finally, 
comparisons  are  drawn  between  tha  two 
approaches  to  modeling  the  same  problem. 

The  contribution  of  our  work  is _ 

two-fold.  First,  the  underlying  approach  to 
modeling  C"2  processes  wa  believe  to  be 
generic  and  applicable  to  other  command  and 
control  missions.  Second,  the  illustration 
of  the  generic  approach  in  terms  of  two 
specific  models  which  have  actually  been 
implemented  and  run  gives  Insight  into  the 
spectrum  of  conclusions  one  can  draw  (using 
different  sorts  of  models)  about  how  command 
and  control  contributes  to  thw  achievement 
of  a  military  mission. 


SECTION  2 

C'“2  SKELETAL  MODEL  AND  MODiTL  ELEMENTS 


objective  (offense).  For  example,  in  air 
defense  the  problem  is  to  detect  threats  and 
assign  weapons  to  shoot  them  down  prior  to 
their  carrying  out  their  assigned  attack. 

The  scarce  resources  in  the  air  defense 
environment  include  the  obvious  ones  such  as 
f ighter-interceptor*  (FIs)  or  surface-to-air 
missiles  (SAMS).  Other  resources  may  be 
more  subtle,  such  as  "attention  slices"  that 
radar  operators  must  devote  to  penetrators 
to  recognize  their  presence.  Identify  them, 
and  generate  reports  for  subsequent 
processing  by  the  system. 

Systems  characterized  by  competing 
demands  upon  scarce  resources,  each  of  which 
is  occupied  for  a  finite  time  while  it  is 
being  used,  are  modeled  mathematically 
within  the  discipline  of  queuing  theory  Cl 3. 
We  therefore  argue  that  a  queuing-based 
formulation  of  command  and  control  is  an 
appropriate  one.  We  shall  illustrate  such  a 
formulation  below. 

”  A  second  aspect  of  command  and  control 
that  must  be  captured  by  any  model  is  that 
C"2  systems  are  feedback  (or  "closed-loop," 
or  "cybernetic")  systems.  This  is 
illustrated  in  Fig.  2-1.  The  generic  C*2 
system  consists  of  sensors  which  gather 
information,  decision  makers  who  act  on  this 
Information,  and  weapons  systems  which  are 
directed  on  the  basis  of  commands  Issued  by 
the  decision  maker*.  All  this  is  tied 
together  by  a  communications  network  of  some 
sort  which  passes  the  information  around. 

(In  our  view,  some  analysts  mistakenly 
consider  the  communication*  network  alone  to 
be  the  ”0*2  system.”)  The  system  is  closed 
loop  because  the  sensors  see  what  happens! 

The  feedback  naturs  of  the  system  is 
critical  to  its  evaluation  because  the  load 
on  the  system  is  governed  by  the  need  to 
re-engage  targets  which  are  not  successfully 
destroyed  the  first  time  they  are  processed. 
It  is  obvious,  then,  that  model*  of  C"2 
systems  must  include  models  of  the  sensors, 
decision  makors,  and  weapons  associated  with 
a  mission,  as  well  as  a  model  of  how  they 
are  tied  together. 


2.1  Basic  Approach 

Command  and  control  is,  in  some  sense, 
the  "glue"  tnat  holds  together  a  complex 
collection  of  military  hardware  and  people 
trying  to  use  it  for  a  purpose.  To  achieve 
a  military  mission,  the  typical  problem  is 
to  apply  scarce  resources  in  a  timely  manner 
to  meet  a  threat  (defense)  or  achieve  an 


In  our  view,  the  appropriate  measures 
of  effectiveness  (MOEs)  for  characterizing 
C^Z  relate  to  time,  leakage,  and  resource 
utilization.  How  long  does  it  take  to 
prosecute  the  average  target?  How  many 
targets  escape?  How  often  are  there  no  FIs 
available  to  engage  known  threats,  or  how 
often  are  all  radar  operators  busy,  causing 
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new  t argots  to  go  undotsctsd?  Queuing 
formulations  deal  precisely  with  these 
questions,  as  we  shall  show  below. 


2.2  C~2  Element  flodels 


Given  the  structure  described  above,  the 
analyst  must  determine  the  degree  of  detail 
needed  in  the  individual  element  models 
inserted  into  the  structure.  Our  objective 
in  the  work  reported  here  was  to  Include 
enough  detail  to  address  the  questions  for 
which  the  model  was  intended,  yet  as  little 
detail  as  passible  about  sensors,  decison 
makers  (Oils) ,.  and  weapons.  An  explicit 
objective  of  the  model  design  was  to 
maximize  execution  speed,  and  one  means  of 
achieving  this  was  to  include  only  those 
model  details  actually  needed. 

SENIORS 

The  sensors  in  our  models  are 
represented  by  single-look  probabilities  of 
detection.  This  parameter  may  be  Influenced 
by  geography  and  relative  sensor /target 
location,  e.g.,  in  terms  of  terrain  masking. 
It  may  also  be  influenced  in  some  cases  by 
offensive  action  such  as  jamming.  The 
analyst  using  the  model  is  able  to 
implicitly  describe  detection  probability  in 
terms  of  physical  quantities  such  as  radar 
power  and  target  cross  section,  which  are 
then  passed  through  the  radar  equation  to 
determine  detection  probability.  However 
all  of  these  physical  parameters  influence 
the  C^2  computations  through  the  one 
probability  numbar. 

The  only  sensors  we  have  modeled  to 
date  are  radars  of  various  types,  both 
ground  based  and  airborne. 

WEAPONS 

As  with  the  sensors,  we  wished  to 
minimize  detail  in  our  weapons  models. 

Thus,  we  chose  to  represent  weapons  in  terms 
of  one-on-one  probability  of  kill.  Other 
parameters  associated  with  weapons  in  more 
detailed  analyses  of  weapons  effects,  such 
as  availability,  for  example,  are  "rolled 
into"  the  single  engagement  kill 
probability.  Multiple  engagements  of  a 
target  by  one  or  more  defensive  assets  are 
characterized  by  the  appropriate  chaining  of 
one-on-one  kill  probabilities. 

The  number  of  weapons  of  a  given  type 
and  their  geog.  aphic  location  are  explicitly 
-epresented  in  the  models;  it  is  possible 
for  all  weapons  in  a  region  to  be  "busy" 
prosecuting  penetrators  with  the  result  that 
subsequent  penetrators  pass  through 
unengaged. 

DECISION  MAKERS 

We  draw  a  distinction  between  decision 
making  tasks  which  are  "easy"  and  those  that 
are  "difficult."  Easy  tasks  are  those  whose 
execution  can  be  represented  merely  in  terms 
of  a  delay  or  in  terms  of  an  algorithm  in 
combination  with  a  delay,  the  algorithm 
specifying  which  of  several  outcomes  results 


under  given  conditions.  Difficult  tasks  are 
those  requiring  "intelligence"  or  “insight" 
to  carry  out. 

In  our  models  to  date,  we  have 
represented  only  easy  tasks,  i.e. ,  DMs  are 
represented  as  delays  and  algorithms.  We 
have  not  yet  considered  model  applications 
requiring  modeling  of  cognitive  processing 
or  "intelligent"  behavior. 

INFORMATION  NETWORK 

The  information  network  we  have  modeled 
to  date  is  considered  to  be  divided  into  a 
surveillance  side  and  an  airspace  control 
ali  e,  as  shown  in  Fig  2-2.  This  division  Is 
f o  ■  conceptual  clarity  only!  in  reality, 
surveillance  side  and  control  side 
installations  in  an  actual  air  defense 
system  may  be  physically  colocated.  Both 
surveillance  <“S-side“>  and  airspace  control 
("C-side")  processing  is  represented  as  a 
three-level  hierarchy.  SOa  are  the  basic 
target  detection  and  report  generation 
units.  These  report  to  Sis,  which  are 
fusion  centers,  themselves  reporting  to  a 
single  S2  which  carries  out  global  data 
fusion  for  the  air  defense  system.  (Some 
SOs  may  report  directly  to  S2  if  desired.* 

Qn  the  control  side, ;  a  single  C2  divides 
target  assignments  among  several  Cls,  which 
further  subdivide  them  among  individual  CO* 
which  represent  SAM  Or  FI  units.  There  may 
or  may  not  be  cross-links  from  SOs  to  CO*  or 
from  Sis  to  Clsl  the  presence  or  absence  of 
such  links  results  in  affects  which  we  wish 
to  study  with  our  models,  as  we  shall  show 
bel ow.  j 

In  this  section',  we  have  described  the 
generic  features  of  our  approach  to  C~2 
modeling.  We  now  describe  how  thesa  are 
reflected  in  the  two1  specific  models 
constructed  to  date.i 

j 

SECTION  3 

QUEU INS-BASED  MODEL  (QUEB) 

In  this  section,  we  will  describe  the 
features  of  the  QUE3  model,  its  formulation, 
and  the  computations  carried  out  when  it  is 
executed.  This  model  is  documented  more 
completely  in  references  C23-C41. 

3. 1  Model  Features 

The  air  defense  engagement  modeled  by 
QUEB  consists  of  the  interaction  of  a 
homogeneous  threat  with  a  mixed  defense. 

That  is,  the  threat  is  assumed  to  consist  of 
a  single  type  of  penetrator,  say  B-52s  or 
cruise  missiles,  while  the  defense  is 
assumed  to  include  both  SAMs  and  FIs 
(assuming  the  model  is  used  to  represent  a 
US  attack  on  the  Soviet  Union.)  The  threat 
is  assumed  to  traverse  a  number  of  discrete 
penetration  paths  which  are  fixed  a  priori 
but  which  are  unknown  to  the  defense. 
Similarly,  the  ground  based  defense 
deployment  is  fixed  throughout  the 
engagement.  (Airborne  defense  elements  move 
about  based  on  their  assignments  to 
targets.)  The  penetrator  routes  may  be 
chosen  by  the  model’s  user  to  avoid  defense 
assets  if  desired. 
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The  information  flow  represented  is 
that  illustrated  in  Fig.  2-2,  however  the 
cross-links  from  SOs  to  COs  and  from  Sis  to 
Cls  are  absent.  Thus  the  information  flow 
is  assumed  centralized  in  that  all  target 
reports  pass  throuah  S2/C2.  Message 
generation  on  the  surveillance  side  is 
probabi 1 i st i c .  representing  non-perfect 
single-look  detection  capability,  however 
the  detection  model  is  a  simple 
"cookie-cutter "  approach.  That  is,  each  SO 
is  assumed  to  "cover"  a  particular 
geographic  region  associated  with  which  is  a 
single  probability  of  detection  per  look 
number.  Radars  can  achieve  a  number  of 
looks  governed  by  the  penetrator  speed,  the 
radar  sweep  rate,  and  the  penetrator  route 
relative  to  the  radar  position. 

The  control  side  represents  several 
types  of  threat  prosecution,  depending  on 
whether  the  particular  unit  carrying  out  an 
engagement  is  a  SAM,  a  single  FI,  or  a  group 
of  FIs  < a  "flight").  As  target  assignments 
oercolate  downward  from  C2  to  Cls  to  COs, 
the  messages  are  handled  differently 
depending  on  the  type  of  defensive  asset 
involved.  For  example,  FIs  are  assumed  to 
have  on-board  radars  (of  characteristics 
specified  by  the  model  user)  which  must 
acguire  the  target  before  the  FI  can  engage 
it.  SAMs,  on  the  other  hand,  must  be 
directed  by  ground-based  fire  control 
radars.  Flights  of  FIs  can  be  represented 
as  jointly  prosecuting  targets. 

In  the  3UEP  model,  the  DMs  are  modeled 
simply  as  delays.  This  is  adequate  since 
alternative  actions  on  the  part  of  the  DMs 
are  not  modeled:  it  is  clear  from  the 
information  network  model  topology  where  all 
information  goes.  The  DMs  have  only  to 
recognize  the  presence  of  targets  (DMs  at 
SOs),  carry  out  fusion  (DMs  at  Sis  and  S2) , 
and  make  increasingly  detailed  target 
assignments  (DMs  at  C2,  Cls,  and  COs). 

While  these  are  perhaps  stressing 
assignments  in  real  life  and  require  some 
"thought,"  it  is  adequate  to  represent  them 
merely  as  delays  in  QUEB  because  the  outcome 
IN  TERMS  OF  WHO  DOES  WHAT  NEXT  is  highly 
constrained  —  although  tasget  assignments 
on  the  control  side  are  reactive  to 
engagement  conditions,  the  fundemental 
information  flow  pattern  SC-S1-S2-C2-C1-C0 
is  fixed.  (Looking  ahead,  this  is  not  the 
case  in  TADZ. ) 

The  C~'2  system  structure  is  fixed  in 
any  given  QUEB  run.  That  is,  neither  the 
topology  of  the  information  network  nor  the 
character i sti cs  of  the  system  elements 
changes  durinq  an  execution  of  the  model. 

3.2  QUEB  Formulation 

QUEB  is  a  set  of  many  simultaneous 
equations,  derived  from  queuing  theory. Cl], 
which  are  iteratively  solved  as  the  model  is 
executed.  Queuing  theory  deals  generically 
with  the  set-up  depicted  in  Fig  3-1.  Here 
"customers"  arrive  into  a  queuing  module, 
which  attempts  to  place  "servers"  at  their 
disposal  to  carry  out  some  task.  A  customer 
which  arrives  when  all  servers  (of  which 
there  may  be  one  or  more)  are  busy  waits  in 


a  "queuing  facility"  until  either  a  server 
becomes  free  to  attend  to  it  or  the  customer 
gives  up  and  leaves  the  module  (in  which 
care  it  is  said  to  "renege").  These  terms 
and  concepts  become  intuitively  clear  if  the 
reader  considers  entering  a  fast-food 
restaurant  seeking  a  meal. 

Queuing  theory  permits  an  analyst  to 
compute  several  statistics  about  the  set-up 
depicted  in  Fig.  3-1.  Riven  statistics 
characterizing  the  arrival  of  customers, 
statistics  characterizing  the  time  it  takes 
to  process  each  one,  and  the  number  of 
servers,  the  theory  provides  equations 
describing  the  statistics  of  the  waiting 
time  that  passes  until  a  server  is  available 
to  process  a  customer,  the  statistics  of 
reneging  (l.e  ,  how  often  customers  leave 
the  module  without  being  served),  and  the 
statistics  of  server  utilization  (i.e. ,  how 
often  everyone  is  busy.)  Examples  of  such 
equations  are  not  given  here  due  to  space 
limitations;  the  interested  reader  should 
see  C13-C31. 

Within  the  QUEB  model,  the  elements  of 
the  C'2  system  are  identified  with  the 
queuing  concept  of  a  server,  and  the 
penetrators  and  messages  abnut  them  are 
identified  as  customers.  Thus,  for  example, 
any  SO  is  considered  to  be  a  server  which  is 
trying  to  service  a  penetrator  by  detecting 
it  and  generating  a  message  which  is  a 
target  report.  Tha  operators  at  the  Sis  are 
trying  to  service  the  incoming  target 
reports  by  fusing  them,  l.e.,  by  determining 
which  reports  from  separate  SO  sites 
represent  in  fact  the  same  physical  target. 
The  individual  threat  prosecution  assets 
(FIs  and  SAMS)  represented  at  the  CO  level 
are  attempting  to  serve  the  penetrators  by 
locking  their  local  sensors  onto  them  and 
destroying  them.  Penetrators  (again 
represented  as  customers)  are,  from  the 
viewpoint  of  their  mission,  only  successful 
if  the  defense  fails  to  complete  their 
service  before  they  leave  the  airspace  of 
the  air  defense  system.  Thus,  the 
penetrators  that  leak  through  the  defense 
are,  in  queuing  terminology,  the  renegers. 

The  overall  structure  of  the  QUEB  model 
is  shown  in  Fig.  3-2.  There  is  a  set  of 
equations  representing  the  surveillance  side 
of  the  system  which  calculate  the  statistics 
of  message  generation.  There  are  equations 
representing  the  resource  allocation  modules 
which  compute  the  statistics  of  order 
generation.  Finally,  equations  representing 
the  threat  prosecution  modules  compute  the 
statistics  of  as  yet  undestroyed  targets. 
These  last  feed  back  to  the  surveillance 
modules  as  shown  in  the  figure.  Equations 
associated  with  all  of  the  modules  represent 
thB  overall  measures  of  performance  and 
effectiveness  associated  with  the  system. 

3.3  QUEB  Computations 

As  stated  above,  QUEB  consists  of  a 
large  set  of  simultaneous  nonlinear 
equations  from  queuing  theory.  These  are 
solved  iteratively  to  yield  the  dynamic 
steady  state  statistics  throughout  the 
system.  These  statistics  can  be  interpreted 
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am  thm  system  lmakagm,  thm  percentage  of 
asset  utilization,  ate. 


SECTION  4 

TRANSIENT  AIR  DEFENSE  ZONE  MODEL  <TADZ) 

In  this  maction  wa  Mill  describe  tha 
TADZ  modal  in  a  mannar  parallal  to  tha 
discussion  of  QUEB  abova:  wa  will  covar  tha 
modal 't  features,  its  f  ormulation,  and  tha 
computations  nacassary  to  execute  it.  TADZ 
is  documanted  in  datail  in  references  CS] 
and  E61. 

4. 1  Modal  Faaturas 

Tha  primary  motivation  bahind  tha 
design  of  TADZ,  which  was  tha  sacond  air 
defense  modal  wo  dovaloped,  wau  to  provida  a 
capability  to  explicitly  datarmina  the 
transient  behavior  of  tha  air  defense  C~2 
system.  In  addition  to  this  desire,  which 
wa  satisfied  by  going  to  an  event-stepped 
simulation  as  we  discuss  below  in  Section 
4.2,  there  were  several  mission  faaturas  we 
wanted  TADZ  to  deal  with  which  were  absent 
in  QUEB.  (QUEB  could  be  extended  to  include 
one  or  more  of  these  features  if  desired! 
see  Section  S. ) 

Tha  first  feature  desired  was  an 
ability  to  deal  with  a  mixed  threat,  i.a. , 
one  consisting  of  several  types  of 
penetrators  (bombers,  cruise  missiles,  ate.) 
arriving  simultaneously.  This  feature  would 
allow  study  of  tha  Impact  of  different 
engagement  priority  doctrines  as  executed  by 
the  C~2  system.  Coupled  with  this  was  a 
daslre  to  model  a  flexible,  adaptive  C^2 
structure  which  would  permit  information 
routing  to  be  responsive  to  the  proqrass  of 
tha  engagement.  In  contrast  to  QUEB,  which 
has  a  fixed,  centralized  C"2  structure,  TADZ 
allows  the  structure  to  vary  during  a  run 
both  because  the  defense  may  choose  to 
change  the  routing  of  messages  and  because 
the  offense  can  destroy  elements  of  the  C~2 
system.  The  C~2  system  adaptivity  is 
manifested  by  its  ability  to  react  to  the 
destruction  of  defense  elements  and  continue 
the  engagement  using  a  changed  C^2 
structure. 

Certain  additional  mission  elements 
were  Included  in  TADZ  beyond  those  present 
in  QUEB.  These  include  ECN  applied  by  the 
offense,  defense  suppression  (as  just 
mentioned),  terrain  masking  of  the  defense 
radars  and  exploitation  of  terrain  masking 
by  the  offense,  and  resupply  of  exhausted 
defense  elements  such  as  FIs  which  have  used 
up  their  fuel  and  SAM  launchers  having  no 
more  missiles. 

The  DMs  in  TADZ  are  represented  both  as 
delays  and  as  algorithms.  The  algorithm 
representation  is  needed  because  in  TADZ  the 
outcome  of  their  decisions  IN  TERMS  OF  WHO 
DOES  WHAT  NEXT  is  not  as  rigidly  fixed  as  it 
is  in  QUEBl  operators  on  the  S-side  of  the 
system  can  decide  where  to  route  messages, 
either  upward  through  the  nominal 
centralized  chain  of  command  or  across  to 
the  C-side  directly.  The  C~2  adaptivity 


-ceptured  in  TADZ  requires  that  the  DMs 
decide  where  to  send  messages  as  a  function 
of  tha  engagement  conditions  as  interpreted 
locally.  There  are  different  classes  of 
messages  represented  in  TADZ,  for  example 
classes  representing  targets  of  different 
priority,  which  are  processed  differently  as 
governed  by  algorithms  representing  C~2 
doctrine. 

The  information  ratwork  in  TADZ  is  that 
shown  in  Fig.  2-3  including  the  potential 
cross-links  from  SOs  and  COs  and  those  from 
Sis  to  Cls.  The  DMs  in  the  system  determine 
which  links  to  route  Information  over  as 
described  in  the  proceeding  paragraph. 

4.2  TADZ  Formulation 

Queuing  theory  does  provida  techniques 
for  transient  analysis  through  computing  the 
time  evolution  of  the  probability 
distributions  governing  the  statistics  of 
the  variables  in  the  system.  However  these 
Involve  solving  partial  differential 
equations.  In  our  judgement,  the 
computational  effort  required  to  do  this  was 
sufficiently  greater  than  that  required  for 
solving  the  QUEB  simultaneous  algebraic 
eauatlons  that  we  preferred  to  examine 
alternative  approaches.  Alternative 
approaches  were  also  considered  because  the 
project  supporting  tne  work  reported  here  is 
explicitly  directed  at  investigating  diverse 
techniques  for  analyzing  C~2.  A  second 
model  of  a  second  type  was  requested  by  the 
sponsor . 

TADZ  is  an  event -stepped  Monte  Carlo 
simulation.  The  C^2  elements  and  their 
interconnection  are  represented  as  nodes  in 
a  network,  each  node  being  represented  as  a 
queuing  module  as  shown  in  Fig.  4-1.  Any 
given  node  is  preceded  by  others  (or  by  a 
target  generation  process)  and  followed  by 
others  (or  by  targets  exiting  from  the 
controlled  airspace).  As  indicated  in  Fig. 
4-1,  messages  or  targets  arrive  and  are 
Interpreted  as  customers  in  a  queuing  sense. 
Activities  are  carried  out  at  the  node  and 
messages  are  sent  to  subsequent  nodes,  as 
appropriate,  i.e.,  the  customers  are  served. 
In  the  course  of  service,  events  are 
defined,  for  example,  target  enters  radar 
coverage,  target  is  detected,  SAM  is 
assigned,  target  survives  engagement,  etc. 
Events  signal  the  start  or  completion  of 
various  C~2  processes,  and  the  time 
necessary  to  execute  these  processes  is 
either  supplied  as  an  input  to  the 
simulation  by  the  user  or  computed  according 
to  a  model  embedded  in  it.  The  simulation 
advances  time  as  event  after  event  occurs) 
this  is  the  sense  in  which  it  is 
event-stepped.  In  other  words,  simulated 
time  does  not  progress  in  uniform 
increments. 

As  the  status  of  each  node  in  the 
network  (as  represented  generlcally  in  Fig. 
4-1)  is  updated  as  a  result  of  events, 
statistics  are  collected  for  later  display. 
Examples  of  these  statistics  are  the  number 
of  busy  operators  at  each  node,  the  current 
backlog  in  the  many  queues,  the  time  taken 
to  process  messages  (for  those  having 
stochastic  times),  etc. 


Am  stated,  the  simulation  im  a  Mont* 
Carlo  model.  Stochastic  effects  rmprmmmntmd 
in  the  model  over  which  the  Monte  Carlo 
samples  can  be  averaged  include  the  arrival 
times  of  the  penetrators,  the  outcomes  of 
detection  opportunities,  the  outcomes  of 
one-on-one  engagements,  and  the  time 
necessary  to  carry  out  various  decision 
processes  as  well  as  others.  Deterministic 
runs  can  be  obtained  by  "turning  off  the 
noises. " 

4.3  TAOZ  Implementation 

Monte  Carlo  simulations  of  C~2  systems 
can  be  very  time  consuming  to  execute.  We 
developed  a  design  for  TADZ  which  keeps  as 
much  calculation  «s  possible  outside  the 
Monte  Carlo  loop  so  that  it  is  only  done 
once. 

The  simulation's  components  are  shown 
in  Fig.  4-2.  Input  and  output  processors 
communicate  with  the  user.  An  event 
processor  executes  outside  the  Monte  Carlo 
loop  to  set  up  the  potential  events  that 
might  occur  during  the  engagement,  l.e., 
these  are  tentatively  scheduled.  The 
concept  is  illustrated  in  Fig.  4-3,  which 
depicts  a  typical  penetration  route  passing 
through  a  region  of  surveillance  coverage 
and  two  "lethality  regions"  within  which 
penetrators  are  vulnerable  to  SAM  attack. 
Based  on  the  penetrator  spoed,  the  nominal 
times  when  penetrators  would  cross  the 
boundaries  of  these  regions  are  computed  and 
tentatively  scheduled.  Whether  these  events 
"actually"  occur  remains  tc  be  seen. 

After  the  preprocessing  just  described 
is  completed,  the  cyclical  calculations  are 
then  carried  out  by  the  network  processor 
using  physical  subsystem  models  and 
information  flow  models  to  compute  the 
outcomes  of  many  random  events.  Some  of 
these  modify  the  precomputed  event  schedule 
so  that  it  reflects  the  experience  of  a 
single  Monte  Carlo  run  as  it  executes.  In 
this  sense,  some  events  that  were 
prescheduled  do  not  "actually"  occur.  Each 
pass  through  the  Monte  Carlo  loop  begins 
with  the  same  tentative  event  schedule, 
however  each  pass  through  the  loop  typically 
completes  with  a  unique  tableau  of  events 
which  actually  took  place. 


SECTION  5 

COMPARISON  AND  CONCLUSIONS 

Two  models  of  air  defense  command  and 
control,  both  built  using  an  underlying 
conceptual  framework  for  viewing  command  and 
control  problems,  have  been  described  in 
this  paper.  Both  have  been  implemented,  and 
the  results  obtained  using  them  have  been 
compared. 

Specific  numerical  results  obtained 
with  these  models  cannot  be  given  in  an 
uhclassif led  publication.  However  it  is 
useful  to  contrast  the  characters  of  the  two 
"odels.  The  QUEB  model  is  rn  analytic  model 
in  the  sense  that  its  execution  involves  the 
solution  of  a  set  of  simultaneous  equations. 

It  is  Implemented  in  about  24000  lines  of 

HeL- 


PASCAL  code  and  runs  in  a  couple  of  seconds 
on  a  VAX  11/750.  We  have  used  it  to  study 
the  impact  on  C~2  effectiveness  of  various 
technology  options.  We  have  seen,  for 
example,  that  several  typical  surveillance 
and  threat  prosecution  upgrades  do  not  have 
a  significant  effect  on  an  overall  measure 
of  effectiveness  such  as  "average  time 
between  penetrator  entry  and  its  first 
encounter  with  a  defender."  It  is  clear  why 
this  is  so  when  the  numbers  are  examined  — 
in  short,  geographic  and  geometric  effects 
dominate  this  MOE  under  the  conditions 
assumed  (and  not  necessarily  under  all 
conditions).  However  the  system  upgrades 
studied  do  have  a  significant  effect  on  the 
pattern  of  loading  imposed  on  the  C~2 
assets.  For  example,  better  surveillance 
leads  generally  to  earlier  detection  with  a 
concomitant  increase  in  load  on  elements  at 
the  "front"  of  the  air  defense  system,  i.e. , 
those  along  the  edge  over  which  the 
penetrators  enter. 

The  TADZ  model  is  an  event-stepped 
simulation  Implemented  in  about  75000  lines 
of  FORTRAN.  The  higher-level  simulation 
language  SLAM  has  also  been  used  in  the 
implementation  of  TADZ.  It  takes  about 
15-30  minutes  to  execute  a  typical  scenario 
using  a  nominal  number  of  Monte  Carlo 
iterations,  e.g.,  10.  Thus  TADZ  is  as  much 
as  1000  times  slower  than  QUEB.  TADZ  does 
provide  much  more  detailed  information, 
however.  It  shows  the  analyst  the  impact  of 
flexible  C~2  doctrine  and  how  the  C~2  system 
reacts  to  perturbations  such  an  destruction 
of  Important  C~2  centers  during  an 
engagement.  When  factors  represented  both 
in  QUEB  and  TADZ  are  compared,  the  two 
models  have  been  found  to  give  similar 
results. 

We  have  concluded  from  our  studies  that 
alternative  and  complementary  models  of  C~2 
can  be  useful  for  studying  various  questions 
about  C~2  systems.  We  have  found  that 
valuable  insight  can  be  gained  about  C~2 
MQEs  using  models  having  minimal  detail  in 
them  about  sensors,  weapons,  and  decision 
makers!  it  is  the  interconnection  and 
interaction  of  these  elements  that  matters 
most  for  some  studies.  Generally,  the 
obvious  is  true:  more  detail  causes  a  slower 
running  model.  The  analyst  must  determine 
if  the  detail  to  be  Included  is  really 
needed  for  his  purposes,  or  whether  he  is 
succumL.ng  to  the  temptation  to  put  it  in 
because  "more  must  be  better."  We  point  out 
that  detail  does  not  in  itself  imply  that  a 
simulation  model  is  required  in  place  of  an 
analytic  one.  As  we  said  in  Section  4, 
several  of  the  features  included  in  TADZ 
that  are  absent  in  QUEB  could  have  been 
added  to  0UEB  if  desired. 

Finally,  we  have  convinced  ourselves 
that  the  skeletal  framework  for  modeling  C*2 
described  in  Section  2  is  a  generally  useful 
one  applicable  to  more  than  air  defense  C~2 
analysis. 
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LANCASTER'S  EQUATIONS  AND  GAME  THEORY 
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.  ....  _  ""  SUMMARY 

v  Generalization  of  Lanchester* s  Equations  leads  to  a 
model  of  combat  In  which  different  types  of  Blue  forces 
(say  Infantry,  artillery,  armor)  suffer  attrition  due 
both  to  aimed  and  area  fire  from  different  types  of 
Orange  forces.  The  attrition  coefficients  depend  on  the 
types  of  forces  and  the  conditions  of  combat.,  The  prob¬ 
lem  considered  here  concerns  optimization  of  the  force 

distributions  x  •  (xj,  . .  xj)  and  y  *  (yi, 

Y2»  ••••  yjh—SuWect  to  constraints  on  the  aggre¬ 
gate  forces  (52  Xf  and  3£yi)  and  given  the  set  of 
attrition  coefficients.  It  Is  shown  that  choosing  the 
objective  function  as  the  difference  of  the  aggregate 
attrition  rates  leads  to  an  optimization  problem  that  is 
mathematically  equivalent  to  a  matrix  game.  It  follows 
that  the  combat  mod  el  has  a  saddle-point  solution,  with 
optimum  force  distribution  vectors  x*  and  y*  which  can 
be  determined  by  a  1 1near  program. 

The  remainder  of  the  paper  Investigates  the 
neighborhood  stability  of  the  model  at  the  opt  1mm 
operating  point  (x*.  y*).  We  define  and  calculate  two 
parameters.  It}  and  k;,  which  partially  characterize 
stability.  A  two  x  two  example  Is  presented  along  with 
sample  unstable  phase  trajectories. 

Comnand  and  Control  decisions  have  great  Influence 
on  the  constraints  governing  the  feasible  set  of 
different  force  compositions  which  can  be  brought  to 
bear  In  a  battle.  It  Is  hoped  that  better  understanding 
of  how  these  constraints  affect  the  (optimum)  battle 
outcome  will  provide  Insight  Into  the  quantitative 
Impact  of  command  and  control  Information  systems. 
Substantial  additional  work  generalizing  and 
Interpreting  the  results  to  date  will  be  necessary, 
however,  before  this  becomes  possible.  ^ _ 
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Additional  motivation  arises  out  of  the  need  for 
»„  ,  Interactive  wargaming  as  a  vehicle  for  developing  new 

ni  command  and  control  doctrine.  Even  In  a  "distributed" 

.  game, where  most  players  can  participate  In  situ.  It  will 

attrition  aue  be  MCeiSary  to  use  player  surrogates  and  to  aggregate 

erent  types  of  com[,at  results  from  one  level  of  resolution  to  another, 

s  depend  on  the  Hopefully,  the  models  studied  here  will  provide  Insight 

ibat.  The  prob-  both  1ntQ  how  t0  aggregate  meaningfully,  and  how  to 

,n  the  f?2ce  design  plausible  surrogates. 


LANCHESTER' S  EQUATIONS  AND  GAME  THEORY 


Background 


_ Consider  a  battlefield  and  assume  that  Orange  and 

Blue  are  opponents.  Each  has  several  different  types  of 
forces  available  -  say  Infantry,  artillery  and  armor  - 
and  the  total  forces  available  to  each  side  are  X  and  Y, 
respectively.  Our  problem  Is  to  determine  an  appro¬ 
priate  distribution  of  forces  for  each  side,  and  \.o 
evaluate  the  cost  of  doing  battle. 

A  more  elaborate  problem  of  the  same  genre  would 
Involve  two  or  more  battlefields,  with  Individual  con¬ 
straints  on  the  total  strength  of  each  type  of  force. 
We  again  wish  to  apportion  the  forces  (across  the  bat¬ 
tlefields)  and  evaluate  the  cost  of  battle. 

Motivation  to  study  problems  of  this  sort  stems 
from  a  belief  that  the  development  of  a  theory  of  Com¬ 
mand  and  Control  requires  that  we  be  able  to  quantify 
the  effect  cf  combat  Information  processes  on  battle 
outcome.  These  Information  processes  feed  decisions 
which  In  due  course  constrain  the  set  of  feasible  force 
distributions.  A  measure  of  the  utility  of  Information 
might  therefore  be  taken  as  the  change  In  the  "optimum" 
cost  of  battle  implied  by  a  change  In  the  constraints  cn 
what  forces  can  be  brought  to  bear. 


Problem  Formulation  and  Results 

In  order  to  model  the  combat  processes  we  adopt  the 
following  generalized  version  of  Lanchester's  Equa¬ 
tions:  [1] 
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the  Xf  and 
forces  of  dlf 
the  u^  and 
clents, 
the  a^j  and 
coefficients, 
the  b  4  4  and 


and  yj  represent  Blue  and  Orange 
different  types, 

and  vj  are  self-attrition  coeffl- 
and  cij  are  area-fire  attrition 


the  b^j  and  d^j  are  almed-flre  attri¬ 
tion  coefficients;  and 

the  rj  and  Sj  are  replenishment  coeffi¬ 

cients. 


We  constrain* 


Yi  >  0 


for  all  1,  J 


E'j 
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If  any  of  the  force  component  levels  Is  zero,  the  cor¬ 
responding  equation  Is  to  be  dropped  out  altogether;  It 


+It  Is  also  possible  to  introduce  constraints  on  the 
Individual  types  of  forces,  **en  more  than  one  combat 
site  Is  Involved.  We  could  also  con:ider  Jie 


weighted  constraint  X 


.•  • 


•  *.  ■-  *.  *  »  V  ■'  '.V*'.  ■*.  •* . 
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does  not  moke  physical  sense  to  assign  attrition  against 
non-existing  assets.  Eqs.  (1),  less  the  linear  terms, 
are  known  as  totka-Vol terra  equations.  They  have  been 
used  extensively  to  model  complex  ecosystems  In  which 
the  stability  of  a  given  population  distribution  Is  a 
central  question  f2]. 

The  question  of  an  optimum  force  distribution  for 
our  system  may  be  approached  by  choosing  an  objective 
function  (l.e.,  a  measure  of  effectiveness)  for  the 
combat  processes.  A  useful  choice  for  our  purposes  Is 
the  difference  of  the  weighted  sums  of  the  loss  rates: 

m  4  23vsi  -  ri>  -  -  v 

i  j 


where 

"ilv  s»Esj 

i  j 

If  the  combat  processes  remain  In  equll Ibrlum  for  T  time 
units,  the  Integrated  cost  Is 

TM*  (4) 

Whether  or  not  the  equll Ibrlun  can  be  maintained, 
of  course,  depends  on  vrfiether  or  not  the  operating  point 
(x*,  y*)  exhibits  neighborhood  stability.  To  Investi¬ 
gate  this  question,  we  linearize  Eq.  1  by  substituting 


For  simplicity,  we  consider  here  only  the  case  In  which 
all  the  weighting  coefficients  are  unity,  so  that 

h  "Z/Si  *  ri>  -  *  v 

i  j 

The  first  major  result  of  our  study  to  date  Is 
that,  for  this  choice  of  M,  there  exist  optlmun  force 
distribution  (row  and  column)  vectors  x*  and  y*  such 
that  for  any  other  vectors  x  and  y 

xAy*  <  M*  _<  x* Ay  (3) 


where 


H*  j  x*Ay* 
and 


A  Is  a  matrix  determined  by  the  attrition  coeffi¬ 
cients. 


X1  -  X1  +  *x1>  +  «>j 


and  discarding  terms  of  2nd  order.  The  result  Is  the 
linear  system 


L*yJ 

In  which  the 


a 


'confl  let 


matrix"  has  the  form 


The  elements  of  C  are  real  and  positive,  and  the  subma¬ 
trices  70  and  'c  are  d lagonal . 

It  Is  shown  In  Appendix  B  that  the  matrix  (f  exhib¬ 
its  the  property  that 


for  all  1 


This  result,  derived  In  Appendix  A,  Is  a  direct  conse¬ 
quence  of  the  fact  that  the  objective  function  M  can  be 
written  In  a  form  mathematically  equivalent  to  a  matrix 
game.  The  optimum  vectors  x*  and  y*  can  therefore  be 
determined  by  solving  a  linear  program.  It  follows  that 
fairly  large  problems  (say,  a  few  hundreds  of  variables) 
can  be  handled  with  a  reasonable  computation  load. 

A  difficulty  arises  with  the  problem  formulation, 
however,  when  the  optimal  vectors  x*  and  y*  have  zero 
components,  either  because  the  solution  Is  degenerate  or 
because  the  orders  of  x  and  y  are  different.  In  such 
cases,  the  objective  function  M  Is  defective.  In  that  it 
Incorporates  almed-flre  attrition  against  non-existent 
forces.  How  best  then  to  proceed  deserves  further 
study;  several  alternatives  (none  obviously  superior  to 
the  others)  come  readily  to  mind. 

A  second  difficulty  enters  because  of  the  fact  that 
Eq.  1  Is  non-linear,  and  can  In  general  only  be  Inte¬ 
grated  nunerically.  In  particular,  the  optimization  Is 
valid  only  at  the  Initial  Instant,  since  the  force  dis¬ 
tributions  will  usually  change  continuously  as  the  at¬ 
trition  processes  evolve  In  time. 

Dynamical  Considerations 

Variations  of  force  composition  can  be  suppressed 
by  the  artifice  of  choosing  the  replenishment  rates 
rj  and  .  sj  to  make 


and 
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from  which  It  follows  Immediately  that 


IX  -  kjlX  -  IT  -  k2ir 


(6) 
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are  the  perturbations  of  the  aggregated  forces  around 
the  operating  point  (x*.  y*). 

The  column- sum  parameters  kj  and  k;  tell  us  a 
great  deal  about  neighborhood  stability.  In  particular. 
It  is  shown  In  the  appendix  that  the  operating  point 
(x*.  y*)  Is  locally  stable  If  ki  and  k?  are  both 
negative.  If  ki  ■  k»  ■  k  >  0,  then  k  Is  the  largest 
eigenvalue  of  -C  and  the  system  Is  unstable.  We 
conjecture,  but  have  not  yet  proven,  that  for  0  <  kj  < 
k2»  -C  has  a  largest  real  eigenvalue  with  k.<x_,v 
<  V  If  true.  It  follows  from  Eq.  6  thStx  nBX 


IX/IY  <  0 


for  *1  <  Amax  <  k2 


(7) 


X1  *  f  "  1.  2 . U  i-l.  2 . J 

at  x  ■  x*  and  y  »  y*. 

Since  the  optimizing  vectors  x*  and  y*  are  unaf¬ 
fected  by  these  replenishment  terms,  the  operating  point 
(x*,  y*)  Is  then  In  equll  Ibrlun  and  the  cost  of  combat 
becomes 

(m  -  R-S  -  M*-  S-R 


so  that  departures  of  the  aggregated  forces  from  an  un¬ 
stable  optimum  equilibria?  will  be  ot  opposite  sign  such 
that  If  one  force  grows,  the  other  must  shrink.  The 
situation  with  kj  and  k2  of  opposite  signs  Is  more 
complicated  and  requires  additional  study,  but  we  an¬ 
ticipate  that  a  similar  phase  relationship  will  hold  for 
all  unstable  cases,  l.e.,  whenever  the  corresponding 
eigenvalue  Is  positive. 


Example 


The  three  aggregate  parameters  M*,  kj  and  k2 
summarize  the  interplay  within  a  rich  non-linear  dynamic 
system  of  great  complexity.  Even  the  2x2  case  in¬ 
volves  26  parameters,  all  of  vrfHch  can  in  general  be 
non-zero.  The  following  numerical  example  serves  to 
indicate  the  diversity  of  behavior  patterns  which  can 
ensue.  For  this  example,  the  bilinear  attrition  coef¬ 
ficients  are  given  by  the  matrices 


the  linear  attrition  coefficients  by 


and  the  sel f-afcrition  coefficients  by 

u  ■  [•>  -3]  •  ■  [.'!] 

The  results  of  optimization  are  tabulated  in  Table 
1,  in  which  for  convenience  we  have  held  the  XY  product 
constant  at  4.  We  note  that  the  operating  point  (x*. 
y*)  is  unstable  for  small  values  of  X,  and  stable  for 
large  values.  When  the  signs  of  kj  and  k2  are  the 
same,  xmax  also  has  that  sign.  But  for  mixed 
signs,  Xmax  may  be  +  or  -.  [A  conjecture  that  the 
largest  absolute  value  governs  in  such  cases  can  be  dis¬ 
proved  by  counterexample]. 

Finally,  we  note  that  the  cost  of  combat,  M*, 
favors  Orange  (e.g.,  y)  for  both  small  and  large  values 
of  X,  whereas  Blue  is  favoi  1  in  the  Intermediate  cases. 
Why  this  happens  becomes  clear  In  terms  of  the  Indjvld- 
ual  aggregate  loss  rates  X  aid  Y.  For  example,  I  X  |  is 
high  for  X  »  1/2  because  of  the  adverse  force  ratio.  As 
X  increases,  |  X  |  decreases  because  Y  decreases. 
Finally,  when  X  *  4  the  area  fire  losses  cause  |  X  |  to 
Increase  agains.  The  same  factors  affect  the  behavisr 
of  |  Y  |,  so  that  M*  is  the  difference  of  two  U-shaped 
curves.  It  is  nonetheless  reassuring  that  the  frac¬ 
tional  loss  rates,  |  X  |/X  and  )  Y  L/Y ,  behave  monoton- 
ically,  as  conventional  wisdom  would  predict. 

Figure  1  shows  two  unstable  phase  trajectories  for 
the  case  X  »  1,  Y  »  4.  These  were  obtained  by  per¬ 
turbing  the  optimum  system  from  its  equilibrium  point  by 
very  ;,nall  amounts  first  in  one,  and  then  in  the  op¬ 
posite,  direction  along  the  eigenvector  corresponding  to 
the  dominant  root  Xmax  «  .137.  In  the  first  case, 
indicated  by  dots  every  40  time  units,  the  system 
diverges  from  (x*,  y*)  with  X  increasing  and  Y  decreas¬ 
ing.  However,  because  of  the  nonlinearity  of  the  gener¬ 
al  ized  lanchester  equations,  there  are  multiple  equilib¬ 
rium  points.  In  this  example,  one  of  these  is  at  ( xi 
»  1.77,  x2  *  1.2862)  and  (yt  *  1.667,  y2  •  .352) 
corresponding  to  X  -  3.056,  Y  *  1.419.  Furthermore, 
this  equilibrium  point  Is  stable  and  the  system  equal¬ 
izes  itself  there,  far  from  the  original  operating 
point.  Here  even  though  Blue  gains  ..he  initial  advan¬ 
tage,  he  is  unaoie  to  completely  eliminate  Orange. 

In  contrast,  when  Orange  gains  the  initial  advan¬ 
tage,  he  completely  eliminates  Blue  in  less  than  120 
time  steps.  This  trajectory  (the  one  on  the  left)  was 
obtained  by  perturbing  the  equilibrium  solution  In  the 
opposite  direction  along  the  eigenvector  of  Xraax. 

Discussion  and  Recommendations 

The  many  Issues  which  remain  outstanding  divide  In¬ 
to  two  main  categories:  technical  and  operational. 
Ouestions  concerning  stability  are  prominent  among  the 


former,  with  proof  of  the  conjecture  that  kj,  k2  >  0 
implies  instability,  and  methods  for  estimating  the  dom¬ 
inant  eigenvalue  when  k^  and  k2  are  of  opposite 
signs,  both  having  high  Interest.  In  addition,  it  is 
important  to  extend  the  stability  analysis  to  Include 
the  case  where  forces  of  different  types  are  constrained 
separately . 

How  to  obtain  approximately  optimum  force  distri¬ 
butions  when  the  solution  of  the  linear  program  is  de¬ 
generate  also  deserves  careful  study;  and  a  great  deal 
of  analysis  plus  numerical  experimentatitn  should  be  de¬ 
voted  to  understanding  the  influence  of  the  various 
parameters  o.n  aggregate  parameters  such  as  M»,  ki, 
k2,  X/X  and  Y/Y. 

A  rich  variety  of  operational  questions  appear  to 
fall  within  the  purview  of  our  model.  By  way  of  il¬ 
lustration,  consider  the  scenario  of  Figure  2.  There 
are  three  battle  areas,  labelled  I,  II  and  III,  with 
three  different  types  of  Blue  forces  and  two  types  of 
Orange.  The  most  obvious  question,  how  should  the 
forces  be  distributed,  given  X  and  Y,  has  already  been 
discussed.  It  is  worth  note  in  passing,  however,  that 
objective  functions  of  the  for* 

M*C17‘C2Y 

may  be  particularly  relevant  to  military  operations;  the 
coefficients  Cj  and  C2  would  presumably  be  chosen  to 
reflect  a  conmander1 s  willingness  to  accept  losses,  with 
the  smaller  C-j  attaching  to  the  offensive  force. 

A  more  subtle  question  concerns  which  battle  areas 
to  defend  (or  attack).  Given  the  attrition  coefficients 
and  the  aggregate  force  levels,  an  unattractive  area 
will  be  assigned  zero  forces  by  the  optimization  rou¬ 
tine.  In  order  to  "shorten  the  lines,"  however,  we 
could  ask  by  how  much  would  the  attrition  coefficients 
need  to  be  changed  (say,  by  building  field  fortifi¬ 
cations)  before  the  arec  would  become  defensible.-  Or  In 
which  area  should  the  field  fortifications  be 
constructed? 

Additional  questions  address  the  force  level  neces¬ 
sary  to  achieve  the  military  objective.  For  example, 
how  large  must  Y  be  (for  given  X)  before  the  combat 
becomes  unstable  (i.e.,  before  a  breakthrough  can  be 
achieved?)  and  in  which  of  the  three  areas  will  the 
breakthrough  occur.  For  this  we  must  look  to  the  eigen¬ 
vector  corresponding  to  the  dominant  eigenvalue.  Con¬ 
versely,  for  the  defender,  how  large  must  X  be  in  order 
to  maintain  stability  and  prevent  a  breakthrouqh?  It 
seems  certain  that  the  modes  of  instability,  and  the 
terminal  states  to  which  they  lead,  will  be  of  far 
greater  operational  Importance  than  the  value  of  M* 
itself. 

To  answer  questions  such  as  those  above  In  a  mean¬ 
ingful  way  will  require  a  great  deal  of  additional  work. 
In  particular,  techniques  for  estimating  and  validating 
attrition  coefficients  appropriate  to  a  specified  style 
and  place  of  combat  need  to  be  developed.  The  concepts 
considered  here  may  provide  useful  guidance  in  how  to 
aggregate  detailed  coefficients  into  higher  level  combat 
models;  but  simulation,  controlled  experimentation  and 
the  analysis  of  real  historical  data  will  be  of  crucial 
Importance. 

Lastly,  the  theoretical  constructs  considered  thus 
far  have  been  essentially  static,  i.e.,  optimization  at 
one  point  in  time.  If  the  optimum  point  is  unstable, 
then  it  seems  unreasonable  to  believe  an  intelligent 
coirniander  will  not  attempt  to  redistribute  his  forces  in 
order  to  capitalize  on  an  incipient  breakthrough,  or  to 
try  and  reverse  one.  Certainly  neither  wi’l  continue  to 
replenish  forces  at  a  constant  rate,  as  assumed  in  our 
model,  but  will  comiit  reserves  in  bulk  at  an  opportune 
time.  When  to  reoptimize  as  the  )"'<-ce  levels  diverqe 
from  their  Initial  equilibrium  rema  ..s  open  to  further 
study.  Surely  one  important  question  is  whether 
another,  stable  equilibrium  point  lies  nearby  the 
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nstable  optimum  point,  as  In  the  example  presented  AeptNUix  a.  wmj 

oove;  or  whether.  If  left  unchecked,  a  catastrophe  Is  _  - _ 

n  the  making  for  the  side  which,  perhaps  by  chance, 

uffers  the  first  small  disadvantage  In  addition.  If  -Measure  of  Effectiveness 
he  force  levels  move  too  far  from  their  Initial  values, 

t  may  become  necessary  to  modify  the  system  equations  Consider  the  genera 

n  reflect  changes  In  the  style  and  locale  of  combat.  , 

Me  conclude  by  expressing  the  opinion  that  an  1m-  uj  *  x<  / 

ortant  potential  area  of  application  for  this  and  re-  j 

ated  work  lies  In  war  games  and  simulations  aimed  at 
valuatlng  and  Improving  conmand  cjiitrol  and  systems  and  f  *  1,  2,  ...,  I 

octrlne.  mu  l 


APPENDIX  A:  GAME  THEORETICAL  OPTIMIZATION  OF 
LANCHESTER  EQUATIONS _ 


Consider  the  general Ized  Lanchester  Equations 

Ji  •  -*i  ui  *  xi2a«  yj  •  Ebu  yj +  ri ; 
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FIGURE  2:  MULTIPLE  BATUEFIElOS 

Th«  notation  x,.wm  fore** 
of  typa  t  mijned  to  battle¬ 
field  i. 


yj  ■  -vj  yj  *  yj  2  cu  xi  •  2  dij  xi +  sj  ; 

i  i 

j  *  1,  2,  ...,  J 

where  the  xj  and  yj  represent  opposing  forces  of 
class  1  and  j  respectively,  and  the  other  parameters  are 
non-negative  constants.  Furthermore,  assume  that  there 
are  constraints 

2  xi  “ x*  Eyj  * Y*  xi  -  °*  yi  “  - 0 

<  j 

for  all  1,  j 

on  the  forces. 

Our  objective  Is  to  determine  advantageous  composi¬ 
tions  for  the  forces.  To  this  end,. It  Is  convenient  to 
define  a  measure  of  effectiveness,  M,  for  the  combat  by 

Hi  Efi<xVri>-E¥yVsj> 

i  i 

The  fj  and  gj  are  arbitrary  weighting  factors, 
which  for  notatlonal  simplicity  we  hereafter  assume  to 
be  unity.  Then 

M  *2xi(cu  ■  au)yj  +Exi(di  *  ui}  -E(bj  *  v*j 

U  f  J 

where  dift2d1j'  bJ  «  2b1j 

i  1 

This  can  be  rewritten  as 

M  -  xAy  -  by  +  id  (A.2) 


where  A  4  C  -  A 

b  j  b  -  •/ 

d  4  d  -  u 


an  I  x  J  rectangular  matrix 
a  J  element  row  vector 
an  I  element  col unn  vector 


and  where  x  and  y  are  ll -element  row  and  J-element  colunn 
vectors  respectively.  1  The  expression  for  M  can  be  writ¬ 
ten  more  homogenously  hy  defining  matrices 


0  4  i[d'!d,' ...] 


W. 


• _  •). 


,ii _ _ ; 

.  ',  j 


so  that 


»  *  V 

.  *  .s  ,* 


v.^.v.v.s-  , ,  f 


V 


H  *  xAy,  (A.3) 

with  “A*  $  X  -  IT  +  "S' 

Game  Formulation 

Me  can  Interpret  M  as  the  payoff  of  a  matr lx  game 
by  defining 

P  A  */*.  <1  £  y/Y,  X  4  XAV 

Then 

M  »  pXq 

from  which  It  follows  [3]  that  there  are  optimum 
probability  vectors  p*  and  q*  such  that,  for  all  x  and 

y. 

xAy*  <  M*  <  x*Ay  (A.4) 

where  M*  £  x#Ay* 


But  Eqs.  A. 5  and  A. 3  together  imply  that 

■  V  *  (*1  -B,  ♦°1>y*  ?1 

so  that,  for  all  1, 

V*  +  ^  •  (M*  by*)/X  $  kj  (B.l) 

Similarly,  for  all  j, 

x*Xj  -  bj  -  (M*  -  x*d)/Y  ft  -k2 
It  follows  that 

iX  -  kjSX  *  8Y  -  k2«Y  { B. 2 ) 

where  »X  4  ^j^ax^,  *Y  A  T!^] 

1  j 

The  parameters  ki  and  kj  serve  to  characterize  par¬ 
tially  the  dynamical  relationship  between  the  aggregated 
perturbations  iX  and  8Y  of  the  opposing  forces. 


x*  •  Xp*,  y*  ■  Yq*  Stab  11 Ity  Anal ys Is 

The  vectors  p*  and  q*  may  be  found  by  solving  (either  In  terms  of  the  perturbation  vectors  lx  and  ly,  Eq. 

of)  the  dual  linear  programs  a.1  can  be  bitten  as 


(P)  min  pc  (0)  max  bq 

S.t.  pX  2  b  S.t.  Xq  <  c 

p  2  0  q  2  0 

(where  b  and  c  are  row  and  column  vectors  of  1  * s,  having 
J  and  I  components  respectively)  and  normal izlng  the 
solutions  by  dividing  by  the  (cormon)  optimum  value  of 
the  objective  functions.  The  reciprocal  of  this  optimum 
value  Is  M*. 

A  result  we  shall  need  later__1nvolves  the  comple¬ 
mentary  slackness  conditions:  If  p,  q  solve  the  linear 
program,  then 

P1  >  0  -»  X(q  -  1  ;  qj  >  0  -.  p  AJ  «  1 

with:  the  1th  row  of  'a'  and  "XJ  the  Jth  column  of 

A* 

It  follows  that 

Xj*  >  0  -*^y*  »  M*/X 

(A. 5) 

yj*  >  0  -*  x^  -  M*/Y 

APPENDIX  3:  NEIGHBORHOOD  STABILI1  / 

The  dynamical  evolution  of  Eq.  A.l  Is  complicated 
by  the  non-linearity  of  the  equations.  Considerable  in¬ 
sight  may  still  be  gained,  however,  by  selecting  ( x* , 
y*)  as  the  operating  point  and  choosing  the  replenish¬ 
ment  rates  rj  and  si  to  make  all  time  deriva¬ 
tives  zero.  Than  (x*.  y*j  Is  an  equilibrium  point,  and 
an  Important  question  Is  whether  or  not  the  equilibrium 
Is  stable. 


qregate  Properties 


He  consider  small  perturbations  around  (x*,  y*)  by 
letting 

for  all  non-zero  components  of  x*,  y*.  Neglecting  2nd 
order  terms,  we  then  have  from  Eq.  A. 2 

2>1  -  5^Jyj  ”  «x$y*  +  2]  +  [x*A  -  b]ay 
1  j 


where  ^  »  A,y*  ♦  u,  'fry  -  xfa(j  ♦  by 

$  1  ■  cu*i  +  d1l  "  x*cJ  4  vj 

and  'a'y  *  "c^ j  *  0  for  al  1  j  f  1 

The  operating  point  (x*,  y*)  will  be  stable  Iff  all  ei¬ 
genvalues  of  the  "conflict  matrix" 


are  In  the  left  half  plane.  By  similarity,  an  equiva¬ 
lent  statement  Is  that  all  eigenvalues  of  the  matrix 

?  4  Ktr]  "•4I 

must  be  In  the  LHP.  „ 

Note  that  the  col  win  sums  on  the  left  side  of  C  are 
given  by 

E  'n  *  -(Aiy* +  “i5  +  E(cu>i 4  du)  •  ki 


and  for  col  wins  on  the  right  by 


IX  ■  -(x*cj  4  v 4  E(x*aij 4  bij)  -  k2 


He  make  two  Important  observations:  First,  all  of  the 
Gershgorln  disks  [4]  of  C  lie  totally  withl,-.  the  LHP 
whenever 

kj  <  0,  k2  <  0  (B. 5) 

which  is  therefore  a  sufficient  condition  for  vx*,  y*) 
to  be  a  stable  operating  point. 

Second,  If 
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kj  «  kj  *  k  >  0 


(8.6) 


then  k  Is  the  dominant  eigenvalue  of  C^and  the  system  Is 
unstable.  That  k  Is  an  eigenvalue  of  C  is  seen  from  the 
fact  that 

Oet  [c  -  kl]  «  0 

since  any  row  of  ~  -  kl  can  be  brought  to  all  zeros  by 
replacing  It  with  the  sums  of  all^he  rows.  Consider¬ 
ation  of  the  Gershgorln  disks  of  C,  all  of  which  pass 
through  k,  show  It  to  be  the  eigenvalue  with  largest 
real  part.  He  conjecture,  but  have  only  been  able  to 
prove  for  special  cases,  that  i  will  always  be  unstable 
when  ki  and  kj  are  both  positive.  The  stability  of 
(T  is  not  determinable  from  ki  and  k2  alone  when  they 
are  of  opposite  sign. 
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|  ABSTRACT 

V  ' 

An  overview  of  an  analytical  approach  to  tbs  aradalinf 
and  evaluation  of  inforaation  proceaaini  and 
dsciaionaakint  organizations  ia  praaantad.  Tba 

aatbaaatical  fraasvork  naad  in  modeling  tba 
individual  decialonaakara  aa  veil  aa  tbs  organization 
ia  that  of  n-diaanaional  inforaation  tbsory.  Tba  data 
flow  foraaliaa  ia  naad  to  aodal  in  a  praeiaa  Banner 
the  various  types  of  interactions  betvaan 

decisionaakers  as  veil  aa  interactions  betvaan  hnaana 
and  tbs  coaaand,  control,  and  ooaannifation  syatea 
that  supports  the  organization.  Coaparlson  and 

evaluation  of  alternative  organizational  foras  is 
accoavlished  by  considering  organizational 

parforaanca,  individual  vorkload,  and  tba  seta  of 
satisficing  decision  stratagias.  ^ 


1.  INTRODUCTION 

la  conaidaring  organizational  atrnctnraa  for 
taaas  of  dseiaionaakers,  a  dasignsr  aust  address  tba 
questions  of  vbo  racaivss  what  inforaation  and  who  it 
assigned  to  aaka  which  decisions.  Tbs  resolution  of 
thess  questirns  spaclfiaa  tba  organizational  fora. 
The  designer's  problaa  ia  the  selection  of  a  fora  so 
that  the  resulting  organization  neats  its  parforaanca 
specifications  and  tba  individual  acabara  are  not 
overloaded,  i.a,,  the  task  requireaents  do  not  ezcaad 
their  individual  proeassing  limitations. 

While  tba  role  of  the  huann  decisionmak  rs  is 
central  to  the  design  problaa,  the  latter  cannot  be 
decoupled  froa  the  consideration  of  tba  inforaation 
ayataa  that  supports  the  organization.  Consider,  for 
ezanple,  a  tactical  ailitary  organization  supported  by 
a  eoaaand,  control,  and  cosssunicat  ions  (C  j  syatea. 
Inforaation  ia  collected  froa  aany  sources, 
distributed  to  appropriate  unite  in  the  organization 
for  processing,  and  used  by  the  coaaanders  and  their 
staff  to  aaka  decisions.  These  decisions  are  then 
passed  to  the  units  responsible  for  carrying  thsa  out. 
That,  a  given  organization  design  iaplles  the 
ezisteace  of  a  C*  syatea  that  supports  it. 
Conversely,  the  presence  of  a  C*  syatea  in  support  of 
an  organization  aodifiss  the  letter’s  operational  it 
aay  create  operational  nodes  not  foreseen  during  the 
organizational  design  phase.  Therefore,  if  a 
quantitative  description  of  the  organization  design 
problaa  ia  to  be  developed,  it  aust  take  into  account 
not  only  the  organization  aenbers,  but  also  the 
collection  of  equipaent  and  procedures  that  constitute 
the  organization's  C’  syatea. 

In  order  to  develop  a  quantitative  nethodology 
for  the  analysis  and  evaluation  of  inforaation 
processing  and  declslonaaking  organizations,  it  is 
necessary  that  a  set  of  coapatible  aodels  be  obtained 
that  describe  the  organization  and  its  environaent. 
This  aodeling  effort  has  been  divide^  in  three  steps. 


The  first  one  ia  the  aodeling  of  the  tasks  the 
organli:ation  ia  to  ezeonte  and  the  definition  of  the 
boundary  between  the  organization  and  its  environaent. 
The  second  step  ia  the  selection  of  aatheaatieal 
aodels  that  describe  the  aesbers  of  the  organization. 
Ths  third  stap  ia  the  aodeling  of  organizational  font, 
i.e.,  the  specification  of  the  Inforaation  and 
decision  structures  that  characterize  the 
organization.  This  step  includes  the  specification  of 
the  protooola  for  inforaation  ezchauge  and  the 
aodeling  of  ths  conaun lest ion  systens,  the  data  bates, 
and  the  decision  aids  that  the  organization  uses  to 
perfora  its  tasks. 

The  aethodology  itself  consists  of  two  aain 
parts.  Ia  the  first  one,  the  analysis  of  ths 
organization,  the  swdela  are  used  to  describe  tht 
organization  in  terns  of  a  locus  defined  on  a 
generalized  performance  -  workload  apace.  This  locus 
ia  obtained  by  computing  aa  iadez  of  parforaanca  for 
the  organization  and  aeasures  of  the  vorkload  for  each 
individual  aeaber  of  the  organisation  at  functions  of 
the  adaisaible  decision  strategies  used  by  the 
deeiaionaakers.  The  second  pert  of  the  aethodology 
addresses  the  question  of  evaluating  organizational 
designs  and  comparing  alternative  structures. 

The  analytical  framework  need  for  aodeling  the 
tasks,  the  individual  organisation  acabara,  the  C* 
syatea,  and  the  organisation  aa  a  whole  ia  that  of  n- 
diaensional  inforaation  theory  [13].  A  brief 
description  of  the  koy  quantities  and  of  the  partition 
law  of  inforaation  [3]  is  presented  in  the  nest 
section. 

2.  INFORMATION  THEORETIC  FRAMEWORK 

Inforaation  theory  was  first  developed  as  an 
application  in  coaaunieation  theory  [13].  Dut,  as 
Khinehin  [9]  showed,  it  is  also  a  valid  aatheaatieal 
theory  in  its  own  right,  and  it  is  useful  for 
applications  ia  aany  disciplines,  including  the 
aodeling  of  siaple  buaan  decislonaaking  processes  [16] 
and  ths  analysis  of  information-processing  systeas. 

There  are  two  quantities  of  priaary  interest  in 
inforaation  theory.  The  first  of  these  ia  entropy: 
given  a  variable  z,  which  is  an  element  of  the 
alphabet  X,  and  occurs  with  probability  p(x),  the 
entropy  of  z,  H(z),  it  defined  to  be 


H(z)  ■  -  J  p(s)  log  p(z) 


and  is  aeasured  in  bits  when  the  base  of  the  algorithm 
is  two.  The  other  quantity  of  interest  is  average 
actual  •  inforaation  or  transmission:  given  two 
variables  z  and  y,  elements  of  the  alphabets  X  and  T, 
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*nd  given  p(x),  p!y),  and  p(xly)  (the  conditional 
probability  of  x,  |Uta  ths  vain*  of  y) ,  the 
transmission  between  x  ud  y,  T(x:y)  it  defiaod  to  bt 

T(x:y)  •  B(x)  -  By(x)  ( 2.2 > 

vhi  r« 


B^(x)  ■  -  J  p(y)  ^  p(xly)  log  p(xly)  (2.3) 

r  * 

it  tht  eoBditiOBtl  uncertainty  ia  tht  variable  x, 
pitta  fall  knowledge  of  tht  value  of  tht  Ttritblt  y. 

McGill  [13]  generalized  thii  bttio  two-variable 
input-output  theory  to  N  diatotioat  by  extending  Eq. 
(2.2): 

N 

T<v*» . a,,)  -2«<x4)  -■<*,.*, . v  «•■«> 

i-1 

For  tht  nodeling  of  aenory  tad  of  itqaeatial 
iupats  which  art  dependent  on  ttch  other,  the  att  of 
the  tatropy  rate,  H(x),  which  dtaeribaa  the  average 
catropy  of  x  per  wait  tiae,  it  appropriate: 

B(x)  ■  lin  i  B(x(t),  x(t+l),...,x(t+a-I)]  (2.3) 


Traaaaiatioa  rattt,  T(x:y),  art  dtfiatd  exactly  like 
traaaaiaaloa,  bat  aaiag  tatropy  ratea  ia  the 
definition  rather  thaa  eatropiee. 

The  Partitioa  Law  of  laforaatioa  [3]  ia  dtfiaed 
for  a  ayatea  with  N-l  iateraal  variablea,  w4  through 
,  aad  an  output  variable,  y,  elao  called  w^.  The 
law  atatea 


iapat  to  the  ayetea  that  ia  aot  lacladed  ia  the 
output.  The  third  tera,  T(w1:w1:...:w„_1:y)  ia  called 
coordlaatioa  aad  deaigaated  Ge>  It  la  the  N- 
diaeaaioaal  traaaaiaaioa  of  the  ayatea,  i.e.,  the 
aaoaat  by  which  all  of  the  iateraal  variablea  ia  ths 
ayatea  oouatraia  each  other.  The  laat  tera, 
BI(w1,wJ .... ,*N_,.y) >  deaigaated  by  Ga  repreaeata  the 
uncertainty  that  reaaiaa  in  the  ayatea  variablea  when 
the  input  it  coapletely  known.  Thi a  aoiae  thonld  not 
be  coaatraed  to  be  aeceaaarily  uadeeirable,  aa  it  ia 
in  coaaaaication  theory:  it  nay  alao  be  thought  of  aa 
internally-generated  information  anpplied  by  the 
ayatea  to  aappleaent  the  input  aad  facilitate  the 
deeiaionaaking  proceaa.  The  partitioa  law  nay  be 
abbreviated: 

0  “  0,  ♦  G.  ♦  0  ♦  G  (2.8) 

C  D  6  I 

i 

A  atateaeat  coapletely  analogoaa  to  (2.8)  can  be 
aade  about  laforaatioa  ratea  by  aubatitutiag  entropy 
rate  aad  traaaaiaaioa  ratea  ia  (2.6). 


3.  TASC  MODEL  [8.1b] 

The  organization,  perceived  aa  aa  opea  ayatea 
[10],  lateracta  with  ite  eaviroaaeati  it  receivea 
tignala  or  aaaaagea  la  varioua  forms  that  contain 
laforaatioa  relevaat  to  the  orgaaixatloa'e  tasks. 
These  aaaaagea  Bust  be  identified,  analyzed,  aad 
traaaaittad  to  their  appropriate  destinations  within 
the  organization.  Froa  this  perspective,  the 
crgaaixatloa  acta  aa  aa  laforaatioa  user. 

Let  the  organisation  rereive  data  froa  one  or 
aore  sources  external  to  it.  Every  vB  aaits  of  tiae 
oa  the  average,  eaoh  source  a  geaeratea  symbols, 
eigaals,  or  aesaages  xnt  froa  its ■ associated  alphabet 
XB,  with  probability  pal,  i.e., 

p»i-*(V‘.i,  *  *.i*x.  . 7» 

(3.1) 


N 

Y  ■(wi)  -  T(x:y)  ♦  Ty(x:w1 ,wj 
i-1 


'N-s 


) 


> 

2Pai’l  »  . «*  (3J> 

1-1 


♦  T(w  :w. 


N-s* 


♦ 


(2.6) 


aad  is  easily  derived  using  laforaatioa  theoretic 
identities.  The  left-hand  aide  of  (2.6)  refers  to  the 
total  activity  of  the  systea,  also  designated  by  G. 
Each  of  the  quantities  on  the  right-hand  side  has  its 
oea  interpretation.  The  first  tera,  T(x:y),  is  called 
throughput  and  is  designated  Gt.  It  aeasures  the 
aaoaat  by  which  the  output  of  the  systea  is  related  to 
the  input.  The  second  quantity. 


Ty(x:wt,wj,...,wfJ_1)  -  T(x:wi,w4,...,wa_1,y) 


-  T(x:y) 


(2.7) 


where  yB  is  the  diaensloa  of  xB.  Therefore.  l/*n  is 
the  naan  frequency  of  symbol  generation  froa  source  a. 

The  organization’s  tssk  is  defined  as  the 
processiag  of  the  input  symbols  xB  to  produce  output 
symbols.  This  definition  iaplies  that  the 
organization  designer  knows  a  priori  the  set  of 
desired  responses  I  sad,  furthermore,  has  a  function 
or  table  L(xs)  that  associates  a  desired  response  or  a 
set  of  desired  responses,  eleaents  of  I,  to  each  input 

*B  *  V 

It  la  assuaed  that  a  specific  eoaplex  task  that 
asst  be  performed  can  be  nodded  by  N’  sources  of 
data.  Bather  thaa  considering  these  sources 
separately,  one  supersource  composed  of  *  these  N’ 
sources  is  created.  The  input  symbol  x'  any  be 
represented  by  sa  N’-dlesnslonal  vector  with  each  of 
the  souroea  represented  by  a  component  of  this  vector, 
i.e.. 


is  called  blockage  and  is  designsted  G^.  Blockage  Bay 
be  thought  of  as  the  aaount  of  information  ia  the 
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V  *  i'*x 


(3.3) 


«'  -  (x^.x,. 


To  ditinlu  th*  probability  that  symbol  x ’  ia 
generated,  tha  independence  between  ooaponanta  a a it  ba 
eoaaidarad.  If  all  components  ara  aataally 


independent,  than  p.  la  tha  product  or  tha 
probabilitiaa  that  each  component  of  xj  taka*  oa  lta 
raapaotiva  value  froa  it*  a**ociat*d  alphabet: 


(3.4) 


If  two  or  aora  eoapoaant*  art  probabiliatieally 
dapaadaat  oa  aaoh  othar.  bat  aa  a  group  ara  aataally 
iadapaadaat  froa  all  othar  eoapoaant*  of  tha  inpnt 
▼actor,  than  that*  dapaadaat  eoapoaant*  eaa  b*  treated 
aa  oa*  *ap*reoapoa*at ,  with  a  aa*  alphabet.  Than  a 
aa*  iapat  vector,  x,  ia  defined,  composed  of  the 
aataally  iadapeadaaT  coaponeat*  and  the**  auper- 
eoapoaeata. 


Figare  1.  Iaforaatioa  Straetaraa  for  Organisations 
4.  MODEL  OF  TEE  ORGANIZATION  MEMBER  [2.3.11] 


This  nodal  of  the  soarea*  iapli**  synchronisation 
batwaaa  tha  generation  of  th*  individual  source 
elaaaata  so  that  they  may.  ia  fact,  ba  treated  aa  on* 
iapat  synbol.  Specif  leal  ty.  it  i*  assumed  that  th* 
a* an  iaterarrival  tin*  for  each  component  xB  ia  equal 
to  v.  It  ia  also  as  sued  that  th*  generation  of  a 
partiealar  iapat  vector.  Xj.  la  i’. dapaadaat  of  th* 
symbols  generated  prior  to  or  after  it. 

Th*  laat  assumption  eaa  b*  weakened,  if  th* 
aoare*  ia  a  discrete  atationary  argodie  on*  with 
eoaataat  iaterarrival  tin*  x  that  eoald  b* 
approximated  by  a  Markov  aoare*.  Than  th*  iafonatloa 
thaoratle  framework  eaa  be  retained  [8]. 

Th*  vector  oatpat  of  th*  aoare*  is  partitioned 
iato  groaps  of  componanta  that  are  aasignad  to 
different  organisation  members.  The  J-th  partition  ia 
denoted  by  *3  aad  is  derived  from  th*  corresponding 
partition  matrix  vhioh  has  dimansion  aj  x  N  aad 
rank  nj.  i.a.» 


Each  eolamn  of  has  at  most  on*  non-xaro  clamant. 
Th*  rasalting  vector s  r  may  have  soma,  all,  or  no 
components  in  eoasioa. 

Th*  sst  of  partitioning  matrices 
shown  in  Fignr*  1  specify  th*  components  of  the  input 
vector  raeeivsd  by  each  member  of  th*  subset  of 
decisionmakers  that  interact  directly  with  th* 
organisation's  environment.  Than*  assignments  ean  b* 
time  invariant  or  time  varying.  Ia  th*  latter  ease, 
th*  partition  matrix  eaa  be  expressed  as 


Th*  complete  realisation  of  th*  modal  of  th* 
decisionmaker  (DM)  who  1*  interacting  with  othar 
organisation  members  and  with  th*  environment  is  shown 
schematically  in  Figure  2. 


Figure  2.  Th*  imteraotiag  Decisionmaker  with  Memory 


Th*  DM  receives  singals  x  a  X  from  th* 
environment  with  iaterarrival  time  r.  A  string  of 
signals  may  be  stored  first  in  a  buffer  so  that  they 
ean  be  processed  together  ia  th*  situation  assessment 
(SA)  stag*.  Th*  SA  stag*  eontains  algorithms  that 
process  th*  incoming  signals  to  obtain  th*  assessed 
situation  i.  Th*  SA  stag*  may  aeeess  th*  memory  or 
internal  data  bat*  to  obtain*  set  of  values  d0.  The 
assessed  situation  s  may  be  shared  with  other 
organisation  member*}  concurrently,  th*  DM  nay  receive 
th*  supplementary  situation  assessment  s'  from  other 
parts  of  th*  organisation)  th*  two  acts  s  and  s’  are 
combined  in  th*  information  fusion  (IF)  processing 
stag*  to  obtain  i.  Son*  of  th*  data  (dj)  from  th*  IF 
process  nay  be  stored  ia  memory. 


for  t  *  T 
for  t  *  T 


(3.0 


Th*  times  at  which  a  decisionmaker  receives  inputs  for 
processing  can  be  obtained  either  through  a 
deterministic  (e.g.,  periodic)  or  a  stochastic  rule. 
The  question  of  how  to  select  th*  sat  of  partition 
matrices,  i.*.,  design  th*  information  structure 
between  th*  environment  and  th*  organisation,  has  been 
addressed  by  Stabile  [17,18]. 


The  possibility  of  receiving  coanands  from  other 
organisation  members  is  modeled  by  th*  variable  ▼*  and 
a  command  interpretation  (Cl)  stag*  of  processing  is 
necessary  to  combine  th*  situation  assessment  x  and  ▼* 
to  arrive  at  th*  choice  v  of  th*  appropriate  strategy 
to  us*  ia  th*  response  selection  (BS)  stag*.  The  KS 
stag*  contains  slgorithms  that  produce  output*  y  in 
response  to  th*  situation  assessment  x  and  th*  command 
input*.  Th*  RS  stag*  may  access  data  from  or  store 
data  in  memory  [7,8]. 
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A 


Figure  3.  Ditilltd  Model  of  the  Interacting 
Decisionmaker 

A  aore  detailed  description  of  th*  decisionmaker 
model  without  buffer  or  memory  Is  shown  In  Figure  3. 
This  figure  shows  ths  Internsl  structure  of  the  four 
prooessleg  stsges:  SA,  IF,  Cl,  end  SS.  The  situstlon 
Assessment  stsge  consists  of  s  set  of  0  Algorithms 
(deterministic  or  not)  thst  sre  cspsble  of  producing 
some  situstlon  sssessnent  i.  Ths  choice  of  slgorithas 
is  sehiewed  through  speclficstion  of  the  internsl 
wsrisble  n  In  uccordsnce  with  the  situation  sssessnent 
strstegj  p(u)  or  p(ulz),  if  a  decision  sld  (e.g.,  s 
preprocessor)  is  present.  A  second  Internsl  decision 
is  the  selection  of  the  Algorithm  in  the  RS  stsge 
According  to  the  response  selection  strategy 
p(vlx.v').  The  two  strategies ,  when  tsken  together, 
constitute  the  internsl  decision  strategy  of  the 
docisionnsker. 

The  analytical  framework  presented  in  Section  2, 
when  Applied  to  the  single  interacting  decislonaeker 
with  deterministic  Algorithms  in  the  SA  end  RS  stsges, 
yields  the  four  aggregate  qusntlties  thst  chsrscterize 
the  informstion  processing  end  decisionmaking  Activity 
within  the  DM  (2,111: 

Throughput: 

Gt  ■  T(*,s',v':*.y)  (4.1) 

Blockage. 

G.  -  B(x.x'.v’)  -  G  (4.2) 

D  C 

Imteramlfy  generated  imf  onset  ion: 

G  •  H(u)  -  H_(v)  (4.3) 

A  Z 

Coordination: 

D 

Go  ■  J  Pjg^tpCs))  ♦  ot  H(pt)  H(x) 
i-1 

♦  g*F(p(x,x’))  ♦  gf1(p(x,v') 

v 

♦  J  Pjg*(p(»lv  -  J))  ♦  Oj  B(Pj)  *  H(y) 

J-l 

♦  B(x)  ♦  H(x)  ♦  B(x.v)  ♦  T%(x’:x') 

♦  Ts(«'.**:v')  (4.4) 


The  expression  for  Gs  shows  that  it  depends  on 
the  two  internal  strategies  p(u)  end  p(vtz)  even 
though  a  comaand  input  nay  exiat.  This  implies  thst 
the  command  input  v*  modifies  the  DM’s  internsl 
decision  after  p(vlz)  has  been  determined. 

In  the  expresrions  defining  the  systea 
coordination,  p^  is  the  probability  that  algorithm  f^ 
has  been  selected  for  processing  the  input  x  end  p,  is 
the  probability  that  algorithm  h>  has  been  selected, 
i.e.,  u  “  1  and  v  »  J.  The  quantities  gc  represent 
the  internal  coordinations  of  the  corresponding 
algorithms  and  depend  on  the  distribution  of  their 
respective  inputs:  the  quantities  a,,  a.  are  the 
number  of  internal  variables  of  the  algorithms  fj  and 
h,,  respectively.  Finally,  the  quantity  B  is  the 
e&tropy  of  a  binary  random  variable: 

B(p)  «  -  plogj  p  -  (1  -  p)logj(l-p)  (4.3) 

Equations  (4.1)  to  (4.4)  determine  the  total  activity 
G  of  the  decisionmaker  according  to  the  partition  law 
of  information  (2.3).  The  activity  G  can  be 
evaluated  alternatively  as  the  sum  of  ths  marginal 
uncertainties  of  each  system  variable.  For  any  given 
internal  decision  strategy,  G  and  its  component  parts 
can  be  computed. 

Since  the  quantity  G  nay  be  interpreted  at  the 
total  information  processing  activity  of  the  systea, 
it  can  serve  as  a  neasnre  of  the  workload 
of  the  organization  nenber  in  carrying  out  hit 
decisionmaking  task. 

The  qualitative  notion  that  the  rationality  of  a 
human  decisionmaker  is  not  perfect,  but  is  bounded 
(12],  hat  been  modeled  as  a  constraint  on  the  total 
activity  G: 

G  •  G  +  G.  +  G  +  G  <Ft  (4.6) 

t  D  A  0  O 

where  tq  Is  the  tymbol  intersrrival  tine  and  F  is  the 
maximum  rate  of  information  processing  that 
characterizes  a  decisionmaker.  This  constraint 
implies  that  the  decisionmaker  most  process  his  input 
at  a  rate  that  is  least  equal  to  the  rate  with  which 
they  arrive.  For  a  detailed  discussion  of  this 
particular  model  of  bounded  rationality,  see  Boettcher 
and  Levis  [2] , 

Weakening  the  assumption  that  the  algorithms  are 
deterninlstie  changes  the  numerical  values  „f  and 
of  the  coordination  term  Ge  [4].  If  memory  is  present 
in  the  model,  then  additional  terms  appear  in  the 
expressions  for  the  coordination  rate  and  for  the 
internally  generated  information  rate  (7,8]. 

3.  ORGANIZATIONAL  FORM 

In  order  to  define  an  organizational  strncture, 
the  interactions  between  the  human  decisionmakers  that 
constitute  the  organization  must  be  specified.  The 
Interactions  between  DMs  and  the  environment  have 
already  been  described  in  Section  3.  The  internsl 
Interactions  between  Dlls  consist  of  receiving  inputs 
from  other  DM’s,  sharing  situation  assessments, 
receiving  command  inputs,  and  producing  outputs  that 
are  either  inputs  or  commands  to  other  DM’*.  The 
detailed  specification  of  the  interactions  requires 
the  determination  of  what  information  is  to  be  passed 
among  individual  organization  members  and  the  precise 
sequence  of  processing  events,  i.e.,  the  standard 
operating  procedure  or  communication  and  execution 
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-protocol  of  the  organixatlon. 

Information  atrnotnraa  that  can  be  moda lad  within 
thin  analytical  framework  arc  thoaa  that  repreaent 
ayuchronixed,  aeyelloal  information  flown.  Since 
iapnti  arc  attuned  to  arrive  at  a  fixed  average  rata, 
the  organixetion  ia  eooatrained  to  prodaca  ontpnta  at 
the  came  average  rate.  The  overall  reaponae  ia  made 
np,  in  general,  of  the  reaponaea  of  aeveral  nembera: 
therefore,  eaeh  member  ia  aaaemed  to  complete  the 
proeeaaing  eorreeponding  to  a  particular  input  at  the 


Within  thia  overal  rate  aynchronixatlon,  however, 
proeeaaing  of  a  apeeifie  input  aymbol  or  vector  taken 
plaee  in  an  aaynebronoua  manner.  If  the  reqolaite 
inputa  for  a  particular  ettgo  ft  proeeaaing  are 
preaent,  then  proeeaaing  can  begin  'ithout  regard  to 
any  other  atage,  which  impliea  tb  eoncurrent 
proeeaaing  ia  preaent.  For  example,  at  toon  an  the 
organixetion  input  arrivea  and  ia  partitioned  through 
a,  proeeaaing  of  x  begina  to  obtain  x.  The  IF  atage 
aroat  wait,  however,  until  both  the  x  and  x*  valnea  are 
preaent.  Each  atage  of  proeeaaing  la  thua  event- 
driveni  a  well-defined  aequenee  of  eventa  ia  therefore 
an  eaaential  element  of  the  model  apeeifieation. 

Aeyelieal  information  atructurea  are  thoae  whoae 
directed  grapha  repreaentlng  the  flown  of  information 
do  not  eontaln  any  eyelet  or  loopa.  Thia  rratrietlon 
ia  made  to  avoid  deadlock  and  circulation  of  meaaagea 
within  the  organixetion.  Deadloek  oceura  when  one  DM 
ia  waiting  for  a  meaaage  from  another  in  order  to 
proeeed  with  hia  tank,  while  the  aeeond  one  ia  in  turn 
waiting  for  an  input  from  the  firat. 

The  ayatem  theoretie  repraaentation  of  the 
organixatlonal  form  ia  uaeful  for  ahowing  the  varloua 
proeeaaing  atagea  or  aubayatema.  For  example,  in 
Figure  4,  a  two  paraon  organixetion  ia  ahown  in  block 
diagram  form  in  which  the  aeeond  member  aenda 
Information  to  the  firat  (x**),  who  in  turn  ean  iaatie 
coanande  to  the  aeeond  DM. 

Evaluation  of  the  varioue  information  theoretie 
quantitive,  including  total  activity,  ean  be 
aeeompliahed  readily,  uaing  the  decompoaition 
property  of  the  information  theoretie  framework  []j. 
However,  the  Internal  information  atructure  of  the 
organixetion  la  often  ambiguoua  when  repreaented  in 
block  diagram  terma.  For  example,  the  requirement 
that  both  x1  and  a**  be  preaent  before  IF1  proeeaaing 
ean  begin  ia  not  apparent  from  Figure  4.  An  alternate 
repreaentation  ia  needed  which  ahowa  explicitly  the 
information  atxueture  |  without  eompromiaing  the 
uaefulneaa  of  the  information  theoretie  deoompoaition 
property.  1 

The  data-flow  eehena  [1,6]  ha  a  been  developed  aa 
a  model  of  information  flow  for  ayatena  with 
aaynehronoua ,  eoncurrent  proeeaaing  aetivitiea.  Three 
baaie  elementa  are  need  tin  their  atructure:  plaeea, 
tranaitiona,  and  directed! area  which  connect  the  two. 
Plaeea  and  tranaitiona  <  repreaent  eonditiona  and 
eventa,  reapeetively.  No  event  occura  unleaa  the 
requiaite  eonditiona  are  mkt,  but  the  occurrence  of  aa 
event  givea  riae  to  new  eonditiona.  Tokena  are  uaed 
to  mark  which  eonditiona  are  in  effect:  when  all  input 
plaeea  to  (eonditiona  f oi )  a  tranaitlon  contain  a 
token  (are  aatiafied),  then  the  event  ean  occur,  which 
in  tarn  reaulta  in  the  generation  of  tokena  for  output 
plaeea.  Since  tokena  are  carrierr  of  data,  each 
tranaitlon  it  a  proeeaaor  which  generatea  a  reault 
from  the  input  data  and  depoaita  it  on  an  output  token 
which  then  movea  aecordlng  to  the  eehena’*  atrueture 
along  a  directed  are  to  the  next  atago  of  proeeaaing. 


Figure  4.  Block  diagram  repreaentation  of  two  peraon 
organixatlon 


To  repreaent  the  Information  theoretic 
deeiaionmaking  model  uaing  a  data-flow  formalian,  a 
wimple  tranalation  in  atrueture  ia  made:  diatinct 
inputa  and  outpeta  of  eaoh  aubayatem  are  aaaigned 
placet  and  the  proeeaaing  within  a  aubayatem  ia 
repreaented  by  a  tranaitlon.  Acaoeiated  with  eaeh 
tranaitlon  la  the  aet  of  internal  variablea  of  the 
aubayatem,  exclu.  >ve  of  the  input  variablea,  which  are 
accounted  for  aeparatoly  by  the  input  plaeea.  By 
attuning  a  probability  dlatribution  on  the 
organixation’a  inputa,  dletributiona  are  alao  included 
on  the  plaeea  ia  the  atrueture.  Therefore, 
dletributiona  are  alao  preaent  on  aubayatem  variablea, 
and  all  information  theoretie  quaatitlea  are  well- 
defined  and  ean  be  eoaputed  aa  before. 

The  organixatlon  atrueture  ahown  la  Figure  4  can 
be  repreaented  ia  data-flow  terma,  aa  ahowa  in  Figure 
J.  Ia  addition  to  plaeea,  tranaitiona,  and  directed 
area,  the  atrueture  contaiaa  two  new  elementa,  the 
awitehea  u*  and  v*.  Theee  are  logieal  elementa  which 
direet  the  flow  of  tokena.  The  twitch  u*  takea 
valuea  Independently,  while  the  value  of  v*  ia 
determined  aa  a  reault  of  the  proeeaaing  by  algorithm 
B*  contained  in  Cl*.  Since  the  atrueture  ahown  in 
Figure  S  ia  equivalent  to  the  ayatem  theoretic 
atrueture  ia  Figure  4,  the  interna*.  variabte 
definition  and  all  information  theoretie  quantitiea 
remain  unchanged.  However,  the  information  etruetcre 
of  the  organixetion  ia  made  explieit  in  Figure  S. 

Onee  an  input  Z  ia  partitioned,  the  proeeaaing  by  each 
DM  ia  hia  reapeotive  SA  atage  (algorithma  f)  begina 
concurrently  and  aayaehronoualy.  The  information 
fuaion  proeeaaing  (algorithm  A*)  muat  wait  until  both 
x*  and  x**  have  arrived  at  the  input  placet  of  IF*. 
Similarly,  DM*  muat  wait  until  DM*  iaauea  a  command 
input  v*  before  the  jroeeta  of  command  interpretation 
ean  begin.  Thia  aequenee  of  proeeaaing  ia  evident 
from  the  repreaentation.  Note  that  becauae  of  the 
attuned  aynehronixation  with  reapect  to  organixatlon 
Inputa,  there  ean  be  at  moat  one  data  token  in  any 
tingle  place.  The  atrueture  ia  obvioualy  aeyelieal 
and  deadloek  in  the  organixetion  ia  prevented. 
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Figure  5.  Data-flow  representation  of  orguniiutioa 
structure. 


While  tho  data-flow  frnnotl  provides  u 
•quiwnloat  representation  for  the  date  of 
synchronous ,  aoyelieal  inforaation  strnetnre  ,  it  ia 
alto  able  to  aodel  store  general  structures,  nany  of 
which  are  of  interest  in  the  context  of  organisations. 
For  exaaiple,  the  frsneworh  can  easily  nodal  the 
eyclle  structures  whieh  arise  when  a  two-way  exchange 
of  inforaation  ia  present  in  an  organisation.  Suoh 
protocols  are,  of  coarse,  coaaon.  In  addition,  fully 
asynchronous  structures  can  be  represented  within  the 
fraaework.  Since  in  a  large  organisation  aeabera  do 
not  operate  at  the  sane  rate  (sane  teapo), 
asynchronous  processing  is  of  auch  interest.  The 
study  of  these  structures  and  their  laplications  in 
teras  of  the  n-diaensional  inforaation  theoretic 
fraaework  are  subjects  of  current  research. 


A  second  advantage  of  the  data  flow  fraaework  is 
that  it  provides  a  natural  way  for  describing  in  a 
precise  aanaer  the  interactions  between  the  M's  sad 
the  data  bases  and  decision  aids  present  in  the 
organisation. 


The  presence  of  data  bases,  an  integral  part  of  a 
C*  ayatea,  requires  the  introduction  of  two  additional 
node  ling  elcaents.  The  first  is  the  query-response 
process.  The  second  is  the  aodeliag  of  the  data 
storage  devices  theaselves.  Consider,  for  exaaple, 
the  situation  asaeasaent  subsystea  shown  is  Figure  6, 
An  accordance  with  the  internal  strategy  a,  an 
algoritha  is  chosen  to  prooess  the  inpnt  s.  However, 
this  algoritha  aay  require  paraneters  (e.g.,  terrain 
infoiaation,  aeteorologioal  data)  or  past  situation 
aasessaents  ia  order  to  do  the  processing.  The  data 
base  is  accessed  and  queried  for  this  inforaation 
through  the  signal  D. .  The  data  froa  the  data  bast 
are  provided  to  the  SA  subsystea  of  the  DM  through  Do. 
The  seas  link.  DI,  can  be  used  to  update  the 
automation  in  the  data  base.  Clearly,  the  Mock 
diagara  representation  it  aabiguoutt  the  data 
flow  forsallaa  allows  for  the  precise  aodeling  of  the 
fact  that  data  is  requested  only  when  certria 
conditions  are  net. 


Figure  6.  Model  of  SA  subsystea  with  data  base  access 


Consider  next  the  effect  of  a  data  base 
containing  data  that  do  not  change  during  the 
execution  of  a  task,  i.e.,  the  data  are  fixed.  At 
first  glance,  it  night  saea  that  the  addition  of  the 
data  base  with  fixed  values  would  have  no  effect  on 
the  total  inforaation  theoretic  rate  of  activity  of 
the  systea,  i.e., 

I(dt)  -  o  i  -  i.a . m  (j.D 

However,  the  problean  is  wore  coaplex.  For  exaaple, 
if  each  algoritha  f ^  accesses  paraaeter  values  froa 
the  data  bate  (in  contrast  to  having  theca  values 
fixed  within  the  algoritha  itself)  then  the  rates  of 
throughput,  blockage,  and  noise  of  the  contained  systea 
will  not  be  affected,  but  the  coordination  tarn  will 
have  additional  activity  rate: 

H 

“o  *  5  Pi  <5.J> 

i-1 

Since  a  data  base  increases  the  overall  activity 
of  the  systea  without  creating  any  change  in  its 
input-output  characteristic,  one  would  queetion  ite 
presence.  There  are  several  advantages:  (a)  reduction 
ia  the  inforaation  that  needs  to  exist  within  the 
slgorithas  or  withiu  the  deeisionaaker  aodel,  (b) 
increased  flexibility  ia  the  use  of  slgorithas  and 
hence  possible  reduction  in  the  nuaber  of  slgorithas, 
and  (c)  access  to  coaaon  data  by  several  organisation 
aeabers.  Even  though  there  is  increased  coordination 
activity  due  to  the  interaction  between  the  DM  and 
the  data  base,  the  total  activity  of  the  DM  nay  be 
reduced  —  the  task  aay  be  redesigned  to  fall  within 
the  bounded  rationality  constraints. 

Siailar  arguments  apply  to  the  aodeling  and 
analysis  of  decision  aids.  Prelialnsry  results 
indicate  that  an  inappropriately  designed  decision  aid 
aay  not  reduce  a  deeisionaaker’ s  inforaation 
processing  load,  but  aay  actually  increase  it  [4]. 

Ia  this  section,  an  approach  to  aodeling  the 
organisational  fore  —  the  specification  of  the 
protocols  for  interaction  between  DM’ s  —  and  tbe 
supporting  eoansnd,  control,  and  comuaication  systea 
has  been  presented.  It  is  based  on  an  integration  of 
the  data  flow  foraelisa  with  the  inforaation  theoretic 
fraaework  used  in  the  quantitative  aodeling  of  the 
decisionaaking  process. 


6.  ANALYSIS  OF  ORGANIZATIONS 


(bn  the  weighting  coefficients  in  probiMllties 


As  stated  im  Section  3.  it  ia  aasumed  that  tha  Corresponding  to  aaeh  Dip^)  it  a  point  in  tha 

designer  knova  a  priori  tha  aat  of  desired  responses  T  ainplax 
to  tha  inpnt  aat  X.  Than  tha  parfornanaa  of  tha 
organisation  in  accomplishing  ita  tatka  aan  be 
evaluated  aa ing  tha  approach  shown  in  Fignra  7.  - 

2  Fk  -  1.  »k  1  o  y  k  <«.j) 

k-1 


Figaro  7.  Farfornanoa  evaluation  of  an  organixation 


Tha  poaaibla  atrataglaa  for  an  individual  EH  ara 
alananta  of  a  eloaad  convex  hyperpolyhedron  of 
diaanaion  n-1  vhoaa  vartleaa  ara  tha  onit  vaetora 
corresponding  to  pare  atrataglaa. 

Bacanaa  of  tha  poaaibla  interaotioaa  anong 
organixation  aenbers.  tha  value  of  6  dapanda  not  only 
on  D(pv)  but  alao  on  tha  internal  daeiaiona  of  tha 
other  deoiaionaakers.  A  pure  organixational  strategy 
ia  defined  aa  a  M- tuple t  of  pure  atrataglaa,  one  froa 
each  DM: 


...n 


{Dk  •  Dk 
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Tha  organixation' a  actual  rasponaa  jr  can  be  eonpared 
to  tha  desired  set  T.  and  a  coat  aaalgnad  using  a  cost 
function  d(y,Y).  ua  axpactad  value  of  tha  coat, 
obtained  bp  averaging  over  all  poaaibla  inputs,  can 
serve  aa  a  performance  index,  J,  for  tha  organixation. 
For  axaaple,  if  tha  function  d(y,Y)  takas  the  valua  oi 
xaro  when  tha  actual  rasponaa  ia  one  of  the  daairad 
ones  and  unity  otherwise,  than 


Independent  internal  decision  atrataglaa  for  aaeh  DM, 
vhethar  pure  or  nixed,  induce  a  behavioral  strategy 
[14]  for  tha  organixation,  uhieh  can  be  axpraaaad  aa 


J  -  B  (d(y,T))  -  p(y  *  1 J  <«.1> 

Ia  this  case,  I  represents  tha  probability  of  tha 
organixation  aukiag  tha  wrong  decision,  i.a.,  tha 
probability  of  e.ror.  Once  tha  organisational  fora  ia 
apacifiad,  tha  total  processing  activity  0  and  tha 
valna  of  organisational  performance  J  can  be  axpraaaad 
as  functions  of  tha  internal  decision  strategies 
aalactad  by  each  decisionmaker.  Let  aa  internal 
strategy  for  a  given  das is ionmakar  ba  defined  as  para, 
if  both  tha  situation  aaaaaamant  strategy  p(u)  and  tha 
response  selection  strategy  p(vlx)  ara  pure,  i.a,,  aa 
algorithm  f(  it  selected  with  probability  ona  and  an 
algorithm  h,  ia  aalactad  also  with  probability  one 
whan  the  situation  aasassed  as  being  8: 

Dk  -  (p(u-i)  -  1  i  p(vjlx-2)  -  1)  (6.3) 

for  soma  i,  sons  j,  and  for  aaeh  *  clamant  of  tha 
alphabet  Z.  Thera  ara  a  possible  pure  internal 
atrataglaa, 

a  -  D-V*  (6.3) 

whara  V  is  tha  number  of  f  algorithms  In  tha  SA  stage, 
T  tha  number  of  h  algorithm  in  tha  RS  stage  and  M  tha 
dimension  of  tha  sat  Z.  All  other  internal  atrataglaa 
are  mixed  (14]  and  ara  obtained  as  convex  combinations 
of  para  strategics: 


■  5  pk»k  <«.«> 
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whara  pfc.  is  tha  probability  of  using  pure  strategy. 
D^..  Because  aaeh  DM  is  assumed  to  select  his 
strategy  indapandantly  of  other  DM's,  tha  strategy 
space  of  tha  organisation,  S°,  ia  determined  aa  tha 
direct  com  of  tha  individual  DM  strategy  spaces: 

S°  -  S*  0  S*0  ...  0  s"  (6.1) 

where  S*  denotes  tha  individual  DM  strategy  spaoa. 
Tha  dimension  of  S°  is  given  by 

M 

,  -  dim  S°  -  ^ 

1-1 

Thus,  tha  organixational  strategies  ara  elements  of  an 
s-dimcnsional  elossd  convex  hyperpolyhsdroa. 

At  A  ranges  over  S°,  tha  corresponding  valnas  of 
tha  performance  index  J  and  the  activity  or  workload 
of  aaeh  individual  organisation  member  can  ba 
computed.  In  this  manner,  tha  sat  3°  is  mapped  into  a 
locus  on  the  Mel  dimensional  performance-workload 
space,  namely  tha  space  Note  that 

only  tha  internal  processing  activity  of  tha 
decisionmakers  is  presented  la  tha  locus  and  not  the 
total  activity  of  tha  system  which  includes  tha 
activity  of  tha  decision  aids,  data  bases,  and  other 
components  of  tha  supporting  C*  system.  Consequently, 
tha  bounded  rationality  constraints  become  hyperplanes 
in  the  performance-workload  space.  Sinoa  tha  bounded 
rationality  constraint  for  all  DM’s  depends  on  T,  tha 
admissible  internal  decision  strategies  of  each  DM 
will  als-  depend  on  tha  tempo  of  operations.  The 
unconstrained  caae  can  be  thought  of  as  the  limiting 
cast  when  v 
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The  Mtkodoloir  tor  the  lulrilt  of 
orgamlsat tonal  structures  illoti  for  the  formulation 
tad  solution  of  too  probloat:  (a)  tht  determination 
of  tha  organisational  atratt|laa  that  ainlatxe  J  and 
(b)  tha  determination  of  tha  aat  of  atratagiea  for 
which  J  1  J.  Tha  first  problaa  la  oat  of  optialxatloa 
while  tha  lattar  la  foraalatad  ao  aa  to  obtala 
satlafleia|  strategies  with  rsspast  to  a  parforaaaca 
thraahold  J.  Tha  satisficing  condition  alto  daflnat  a 
plant  la  tha  p*rforaane*-workload  apaca  that  la  noraal 
to  tha  J  axis  and  Intaraacta  It  at  J.  All  points  on 
tha  loona  on  or  halo*  this  plant  ahleh  alto  satisfy 
tha  bonadad  rationality  constraint  for  aach 
daalslonaakar  la  tha  organisation  daflaa  tha  aat  of 
satlaflclng  daclslon  atrataglaa.  Analytical 
properties  of  this  locos  aa  ncll  aa  a  computational 
approach  to  Its  officiant  eoastrnatlon  have  bam 
dlaenaaad  In  [2,3,11]. 

A  qualitative  evaluation  of  an  organisational 
structure  can  be  made  by  eoaiparing  tha  parforaaaea- 
uorkload  loans  to  tha  apace  defined  by  the  satisficing 
and  bounded  rationality  conatrainta.  In  tha  tana 
aaaner,  alternative  organisational  atrntnraa  can  ba 
compared  by  conaldaring  their  respective  loci. 

Since  individual  decisionaakers  select  their  on 
daclaioa  atrataglaa  independently  of  all  other 
organisation  aaabara,  a  particnlar  organisational  fora 
can  yield  a  broad  range  of  perforaanea  aa  illustrated 
by  tha  loent  In  the  perforuence-vorkload  apaca.  The 
designer  auat  aasaaa,  tharafora,  tha  likelihood  that 
atrataglaa  which  lead  to  aatiaf icing  perforaanea  *111 
ba  selected.  A  poaaible  aaaanra  of  this  mutual 
eoasisteaey  between  Individually  selected  atrataglaa 
can  ba  obtained  by  coaparing  the  loans  of  tha 
aatlaflelag  atrataglaa  to  tha  loena  of  tha 
organisation's  strategy  space  S°.  Let  I1  ba  tha 
aubapacas  of  organisation  atrataglaa  which  are 
feasible  with  raapaet  to  tha  bounded  rationality 
constraint  of  each  DM,  i.e., 

I*  -  (A  I  01(A)  <  Fl  w)  (6.9) 

and  let  **  contain  tha  atrataglaa  that  satisfy  the 
perforaanea  threshold  J: 

V?  -  (A  I  J (A)  1  J]  (6.10) 

Tha  aubapaca  of  aatiaf icing  atrataglaa  K°  la  gives  by: 

t°  -  »*n...n*l,n  **  (6.11) 


The  volnaia  of  1®,  denoted  by  V(R°)  is  compared  with 
that  of  S®,  V(S°),  to  deteraine  tha  aeasnre  of  untnal 
cosaiataney,  Q,  i.e.. 


0  -  V(*°)/V(S®)  (6.12) 

Tha  ratio  Q  is  a  aonotonie  function  of  J  cad  t 
with  alniaua  saro  and  anxiaua  one.  A  null  value  for  Q 
iapliaa  that  ao  coabiaatlon  of  atrataglaa  of  tha 
individual  daeiaionaakara  will  satisfy  the  design 
specifications,  whila  unity  iapliaa  that  all 
organisational  atrataglaa  are  feasible,  i.e.,  satisfy 
tha  bounded  rationality  conatrainta  and  tha 
perforaanea  apsaif last  ions. 

Since  Q  can  ba  sxprsasad  aa  a  function  of  J  and  t 
only,  it  can  ba  plotted  in  the  three— diaenaional  apaca 
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(Q.J.v).  A  typical  plot  froa  a  three  DM  axaapla  [3] 
la  shown  in  Figure  8. 


J 


8.  Mu tut 1  eonalatancy  neacure  Q  varans  J  and  T. 


7.  CONCLUSIONS 

Aa  analytical  approach  to  oodallng  organisational 
atrnctnrsa  for  tsans  of  daeiaionaakara  supported  by 
eonaand,  control,  and  conannication  (C*)  syatena  has 
bean  described.  Tha  integration  of  n-diaensional 
inforaation  theory  with  the  data  flow  acheaa  provides 
tools  for  describing  tha  activities  and  Intaraetiona 
within  each  daeisionaaker  aodal,  aaong  daeiaionaakara, 
and  batwaan  a  dscisionnaker  and  tha  anpporting  C* 
syatsa.  Thila  only  synchronous  procaaaing  with 
aeyclical  Inforaation  structures  has  been  eonaidsrsd 
in  detail,  tha  approaoh  shows  proala*  for  tha  aodal ing 
and  analysis  of  asynchronous  Inforaation  procaaaing 
and  daelaionaaking.  Furthermore ,  tha  introduction  of 
aaaory  in  the  daeisionaaker  aodal,  and  data  base* 
tha  organisational  structure  hra  broadened  the  r 
of  organisation*  and  task*  that  can  ba  analyse*  sing 
thia  approach. 
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-f!  .  T  Abstract  Processor  network  environments  for  which  weakly 

■  '  ’’  1  f  ■'  c  '*•  coordinated  distributed  computation  seems  particularly 

-We  present'll  algorithmic  model  for  distributed  advantageous  typically  possess  one  or  more  of  the  fol- 

omputation  of  fixed  points  whereby  several  processors  lowing  characteristics  all  of  which  involve  occurance 

areicipate  simultaneously  in  the  calculations  while  of  some  type  of  unpredictable  event, 

xchanging  information  via  communication  links.  Vie-  Mi 

lace ^essentially  no  assumptions  on  the  ordering  of  I)  Computation  nodes  and  communication  links  are 

omputation  and  communication  between  processors  there-  subject  to  frequent  and/or  unexpected  failuies.  (For 

y  allowing  for  completely  uncoordinated  execution.  example  packet  radio  networks), 

ttid  that  even  under  these  potentially  chaotic  circum- 

tances  it  is  possible  to  solve  several  important  2)  Computation  nodes  have  diff 

lasses  of  problems  including  the  calculation  of  fixed  ing  speeds  of  execution.  (For 

oints  of  contraction  and  monotone  mappings  arising  in  is  assigned  to  a  perhaps  time  v 

inear  and  nonlinear  systems  of  equations,  shortest  involving  computation  loads  whi 

ath  problems,  and  dynamic  programming.  ^  priori). 

1.  Introduction  '  "  '  3)  Computation  at  various  node 


2)  Computation  nodes  have  different  and/or  time  vary¬ 
ing  speeds  of  execution.  (For  example  each  processor 
is  assigned  to  a  perhaps  time  varying  number  of  tasks 
involving  computation  loads  which  are  not  fixed  a 
priori). 


There  is  presently  a  great  deal  of  interest  in 
istributed  implementations  of  various  iterative  algo- 
ithms  whereby  the  computational  load  is  shared  by 
everal  processors  while  coordination  is  maintained  by 
nformation  exchange  via  communication  links.  In  most 
■f  the  work  done  in  this  area  the  starting  point  is 
ome  iterative  algorithm  which  is  guaranteed  to  con- 
•erge  to  the  correct  solution  under  the  usual  circum- 
tances  of  centralized  computation  in  a  single  proces- 
or.  The  computational  load  of  the  typical  iteration 
s  then  divided  in  some  way  between  the  available  pro- 
:essors,  and  it  is  assumed  that  the  processors  exchange 
ill  necessary  information  regarding  the  outcomes  of  the 
:urrent  iteration  before  a  new  iteration  can  begin. 

The  mode  of  operation  described  above  may  be  term¬ 
ed  synchronous  in  the  sense  that  each  processor  must 
:omplete  its  assigned  portion  of  an  iteration  and  com- 
mnicate  the  results  to  every  other  processor  before 
i  new  iteration  can  begin.  This  assumption  certainly 
snhances  the  orderly  operation  of  the  algorithm  and 
jreatly  simplifies  the  convergence  analysis.  On  the 
Jther  hand  synchronous  distributed  algorithms  also  have 
tome  obvious  implementation  disadvantages  such  as  the 
teed  for  an  algorithm  initiation  and  iteration  synchro- 
lization  protocol .  Furthermore  the  speed  of  computa¬ 
tion  is  limited  to  that  of  the  slowest  processor.  It 
is  thus  interesting  to  consider  algorithms  that  can 
tolerate  a  more  flexible  ordering  of  computation  and 
totnnunication  between  processors.  Such  algorithms  have 
so  far  found  applications  in  computer  communication 
networks  like  the  ARPANET  (1)  where  processor  failures 
are  common  and  it  is  quite  complicated  to  maintain 
synchronization  between  the  nodes  of  the  entire  network 
as  they  execute  real-time  network  functions  such  as 
the  r-uting  algorithm. 


•This  research  was  conducted  at  the  M.I.T.  Laboratory 
for  Information  and  Decision  Systems  with  partial  sup¬ 
port  provided  by  the  Defense  Advanced  Projects  Agency 
under  Contract  No.  ONR-N00014-7S-C-1183. 


3)  Computation  at  various  nodes  is  event  driven. 

(For  example  in  data  collection  or  sensor  networks 
where  the  timing,  and  ordering  of  measurements  may  not 
be  predictable.). 

It  is  possible  to  consider  various  degrees  of  co¬ 
ordination  in  different  types  of  distributed  algorithms. 
An  interesting  question  is  to  determine  the  minimum 
degree  of  coordination  needed  in  a  given  algorithm  in 
order  to  obtain  the  correct  solution.  To  this  end  we 
consider  an  extreme  model  ot  uncoordinated  distributed 
algorithms  whereby  computation  and  communication  are 
performed  at  each  processor  completely  independently 
of  the  progress  in  other  processors.  It  is  perhaps 
surprising  that  even  under  these  chaotic  circumstances 
it  is  still  possible  to  solve  correctly  importment 
classes  of  fixed  point  problems.  The  complete  analy¬ 
sis  is  given  in  [2]  for  broad  classes  of  dynamic  pro¬ 
gramming  and  in  [3]  for  more  general  fixed  point  pro¬ 
blems  involving  contraction  and  monotonicity  assump¬ 
tions.  Further  related  work  is  [5]  and  [6]. 

2.  A  Model  For  Distributed  Uncoordinated  Fixed 
Point  Algorithms 

The  fixed  point  problem  considered  in  this  paper 
is  defined  in  terms  of  a  set  X,  a  class  F  of  functions 
mapping  into  the  extended  real  line  [-<*>,♦<»},  and  a 
mapping  T  which  maps  F  into  itself.  Ne  wish  to  find 
an  element  J*  of  F  such  that 


or  equivalently 


T(J»)(x), 


V  xeX, 


where  J* (x)  and  T(J*)(x)  denote  the  values  of  the  func¬ 
tions  J*  and  T(J*)  respectively  at  the  typical  element 
xeX.  We  will  assume  throughout  that  f  has  a  unique 
fixed  point  J*  within  the  set  F. 


We  provide  some  examples: 


Example  1:  (Fixed  points  of  mappings  on  Rn) .  Let  X 
be  the  finite  set 


X  =  (1,2 . n}. 

and  F  be  the  set  of  all  real-valued  functions  on  X. 

Then  F  can  be  identified  with  the  n-dimensional  space  Rn 
in  the  sense  that  with  each  JeF  we  can  associate  the  n- 
dimensional  vector  J(l),  J(2) , . . . ,J(n) .  Similarly  T(J) 
can  be  identified  witn  the  n-dimensional  vector  T(J)(1), 
...,T(J)(n),  so  the  fixed  point  problem  (1)  amounts  to 
solving  the  system  of  n  equations 


J*  =  T(J*)  or  J*(i)  =  T(J*)(i),  V  i  =  i;...,n 

(3) 

with  the  n  unknowns  J*(l) , . . . ,J*(n) .  It  is  also  evident 
that  any  system  of  n  (possibly  nonlinear)  equations 
with  n  unknowns  can  be  formulated  into  a  fixed  point 
problem  such  as  (3) . 

Example  2:  (Shortest  path  problems).  Let  (W , L)  be  a 
directed  graph  where  N  =  (1,2, ...,n)  denotes  the  set 
of  nodes  and  L  denotes  the  set  of  links.  Let  N(i) 
denote  the  downstream  neighbors  of  node  i,  i.e.,  the 
set  of  nodes  j  for  which  (i,j)  is  a  link.  Assume  that 
each  link  (i,j)  is  assigned  a  positive  scalar  a., 
referred  to  as  its  length.  Assume  also  that  13  there 
is  a  directed  path  to  node  1  from  every  other  node. 

Then  it  is  known  ([4],  p.  67)  that  the  shortest  path 
distances  J*(i)  to  node  from  all  other  nodes  i  solve 
uniquely  the  equations. 

J*(i)  =  min  {a..  ♦  J*(j)}  ,  i  /  1  (4a) 

jeti(i)  3 

J*(l)  =  0  (4b) 


If  we  make  the  identifications  X  =  {l,2,...,n},  F:  Set 
of  all  functions  mapping  X  into  [0,-k»],  and  define 
T(J)  for  all  JeF  by  means  of 


T(J)(i) 


min  {a..  ♦  J(j)}  if  i  /  l  (5) 

jcN (i)  13 

0  if  i  =  1 


then  we  find  that  the  fixed  point  problem  (2)  reduces 
to  the  shortest  path  problem. 


obtained  from  the  latest  computation  to  one  or  more 
nodes  j  (j/  i) .  In  the  idle  state  node  i  docs  nothing 
related  to  the  solution  of  the  problem.  It  is  assumed 
that  a  node  can  receive  a  transmission  from  other 
nodes  simultaneously  with  computing  or  transmitting. 


We  assume  that  computation  and  transmission  for  each 
node  takes  place  in  uninterupted  time  intervals  [t,,t,] 
with  tj  <  t2>  but  do  not  exclude  the  possibility 

that  a  node  may  be  simultaneously  transmitting  to  more 
than  one  nodes  nor  do  we  assume  that  the  transmission 
intervals  to  these  nodes  have  the  same  origin  and/or 
termination.  We  also  make  no  assumptions  on  the 
length,  timing  and  sequencing  of  computation  and  trans¬ 
mission  intervals  other  than  the  following: 

Assumption  (A):  There  exists  a  positive  scalar  P  such 
that,  for  every  node  i,  every  time  interval  of  length 
P  contains  at  least  one  computation  interval  for  i  and 
at  least  one  transmission  interval  from  i  to  each  node 

j  i  i. 


Each  node  i  also  has  a  buffer  B^  for  each  j  t  i 

where  it  stores  the  latest  transmission  trom  j ,  as 

well  as  a  buffer  B. .  where  it  stores  its  own  estimate 
n 

of  values  of  the  solution  function  for  all  xeX. .  The 

1  t 

contents  of  each  buffer  B. .  at  time  t  are  denoted  J... 
t  l) 

Thus  J. .  is,  for  every  t,  a  function  from  X.  into 
ij  J 

and  may  be  viewed  as  the  estimate  by  node  i  of 
the  restriction  of  the  solution  function  J*  on  X^ 

available  at  time  t.  The  rules  according  to  which  the 
functions  are  updated  are  as  follows: 

1)  If  [tj,t2]  is  a  transmission  interval  from  node  j 

1  j 

to  node  i  the  contents  of  the  buffer  B^  at  time 
tj  are  transmitted  and  entered  in  the  buffer  B„  at 
time  t^,  i.e. 


A 

jj 


(6) 


The  shortest  path  problem  above  is  representative 
of  a  broad  class  of  dynamic  programming  problems  which 
can  be  viewed  as  special  cases  of  the  fixed  point  prob¬ 
lem  (2)  and  can  be  correctly  solved  by  using  the  dis¬ 
tributed  algorithms  of  this  paper  (see  [3]). 

Our  algorithmic  model  can  be  described  in  terms 
of  a  collection  of  n  computation  centers  (or  proces¬ 
sors)  referred  to  as  nodes  and  denoted  l,2,...,n.  The 
set  X  is  partitioned  into  n  disjoint  sets  denoted 

X,,...,X  ,  i.e. 

1  n 

n 

X  =  U  X.  ,  X  rxx.  =  0,  if  i  /  j. 

is!  1  J 


2)  If  [tj,t2]  is  a  computation  interval  for  node  i 
the  contents  of  buffer  B^  at  time  t,,  are  replaced  by 

the  restriction  of  the  function  T(Jj  )  on  X^  where, 
for  all  t,  J?  is  defined  by 


J-(x)  = 


J?.  (x)  if  xeX. 

11  1 

(x)  if  xeXj ,  j  t  i 


(7) 


In  other  words  we  have 


Each  node  i  is  assigned  the  responsibility  of  comput¬ 
ing  the  values  of  the  solution  function  J*  [c.f.  (1), 
(2)]  at  all  xeXp 

At  each  time  instant,  node  i  can  be  in  one  of 
three  possible  states  compute,  transmit,  or  idle.  In 
the  compute  state  node  i  computes  a  new  estimate  of 
the  values  of  the  solution  fraction  J*  for  all  xeX. . 

l 

In  the  transmit  state  node  i  communicates  the  estimate 


*2  ^1 

J.^(x)  =  T(J.1j(x),  V  xeX.  (8) 

3)  The  contents  of  a  buffer  B^  can  change  only  at 

the  end  of  a  computation  interval  for  node  i.  The 
contents  cf  a  buffer  B„  ,  j  /  i  can  change  only  at  the 

end  of  a  transmission  interval  from  j  to  i. 
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Additional  conditions  under  which  ther  holds 
ia  J*(x)  *  J*(x).  Y  xeXi(  i  «  X,...,n  (9) 

•M 

*y  be  found  in  [2] ,  [3] .  An  inte-esting  aspect  of 
esults  of  this  type  is  that  they  do  not  require  that 
:he  initial  processor  buffer  contends  be  identical  and 
.ndeed  these  initial  conditions  can  vary  within  a 
■road  range.  This  means  that  for  problems  that  are 
■eing  solved  continuously  in  real  time  it  is  not  neces¬ 
sary  to  reset  the  initial  conditions  and  resynchronize 
:he  algorithm  each  time  the  problem  data  changes.  As 
i  result  the  potential  for  tracking  slow  variations  on 
:he  solution  function  is  improved  and  algorithms  imple- 
■entation  is  considerably  simplified. 
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Abstract.  It  has  bean  shown  In  previous  wo.k  that  generalized  fu2zy  set  theory  and  infinite- 
valued  logic  provide  t:  systematic  approach  to  the  modeling  and  use  of  both  natural  language 
and  numerlcal/statlstical  Information  which  occurs  In  the  tracking-data  association  and  re¬ 
lated  problems.  This  paper  continues  efforts  In  establishing  connections  between  these  disci¬ 
plines  and  classical  probability  theory.  It  has  been  shown  that  over  discrete  Spaces,  prob¬ 
abilistic  concepts  are  all  special  cases  of  generalized  fuzzy  set  ones.  Conversely,  many 
fuzzy  set  systems  ccn  be  shown  to  be  natural  extensions  of  ordinary  set  operators  through 
Isomorphic-like  relations  with  corresponding  random  set  representations  via  one  point  coverage 
functions. Among  the  new  results  presented  here.  It  Is  r.hown  that  any  fuzzy  set  membership 
function  has  naturally  compatible  rs.,Jom  set  and  random  variable  representations.  In  the 
latter  case,  the  membership  function  Is  the  same  as  the  evaluation  function  of  the  (non- 
unique)  corresponding  random  variable  over  a  suitably  chosen  collection  of  compound-  and. 

In  general,  overlapping-  sets  or  events.  An  application  to  the  classification  problem  Is 


presented. 


INTRODUCTION 

Although  some  Investigators  In  the  field  of  cognitive 
rationality  have  come  to  the  conclusion  that  rational 
decision  making  by  humans  Is  unobtainable  [1],  It  is 
the  optimistic  belief  of  many  others  (see.e.g.,  Cohen 
£2])  that  intuition,  properly  systematized,  can  serve 
as  a  basis  for  the  cholo;  of  a  normative  theory  of 
decision  making.  In  turn,  the  latter  depends  critically 
upon  which  fundamental  theory  or  theories  of  uncertain¬ 
ties  and  beliefs  to  accept.  Fine  (3]  considered  vari¬ 
ous  approaches  Involving  classical  and  subjective 
probability  theory  with  underlying  emphasis  on  model¬ 
ing  random  variables  rather  than  random  sets.  More  re¬ 
cently,  Freellng  [4]  has  analyzed  some  nonstandard 
models.  Including  upper  and  lower  probabilities  and 
the  Dempster/Shafer  theory  of  belief,  Zadeh's  possibil¬ 
ity  theory,  Cohen's  Inductive  probabilities  and  second 
order  Bayesian  probabilities,  as  well  as  Shackle's 
degree  of  surprise  approach.  However,  without  a  single 
basis  which  can  be  used  for  comparisons,  Freellng  has 
not  been  able  to  demonstrate  any  deep  structural  rela¬ 
tionships  among  the  various  schools  of  thougnt,  and 
consequently,  comparisons  are  limited  quantitatively. 
Manes  [5]  has  proposed  a  formal  axiom  system  for  model¬ 
ing  uncertainties  which  generalizes  Zadeh's  original 
fuzzy  set  theory  and  Is  based  on  the  Klelsll  category 
theorem  for  monoids.  (However,  It  does  not  capture  a 
large  class  of  generalized  fuzzy  set  systems-see  [6].) 
In  addition.  Manes'  system  extends  Oempster-Shafer, 
topological  neighborhood,  credibility,  and  other 
theories. But  again.  It  must  be  emphasized  that  this 
general  theory  Is  considering  formal  similarities  of 
various  uncertainty  approaches,  not  their  Internal 
structures  and  their  relationships  to  each  other.  In 
addition,  there  Is  the  ongoing  controversies  between 
the  proponents  of  the  various  approaches  viewed  some¬ 
what  parochially  from  their  respective  stands.  For 
example,  see  Goodbnan  [7]  for  a  listing  of  argumenta¬ 
tions  between  fuzzy  set  adherents  and  Bayesian  oriented 


Individuals,  "r,  see  the  L1ndley"paradox"  discussions 
between  Dempster-Shafer  and  Bayesian  backers  (8). 
Polemics  aside,  techniques  will  have  to  be  developed 
which  In  some  reasonable  sense  model  uncertainties  as 
faithfully  as  possible  so  that  applications  to  decision 
making  may  be  carried  out  on  both  the  basic  Intuitive 
level  and  normative  rigorous  level,  as  discussed  by 
Llndley  and  Brown  In  the  reconciliation  of  Incoherent 
data  (9). 

Random  set  theory  and  Its  modifications  (such  as  through 
equivalence  classes  of  random  sets)  could  well  provide 
a  key  to  a  meaningful  analysis  of  the  myriad  approaches 
to  the  modeling  of  uncertainties  by  the  establls!  <ent 
of  methematlcally  rigorous  relationships  between  the 
theory  of  perception,  natural  language  descriptions, 
multivalued  truth  and  sat  theory,  and  the  many  schools 
of  thought.  The  Intuitive  basis  for  the  use  of  random 
sets  appears  only  slightly  more  complicated  than  that 
for  ordinary  random  variables.  Indeed,  as  the  new  re¬ 
sults  of  this  paper  show,  there  Is  a  natural  relation¬ 
ship  between  random  set  and  random  variable  representa¬ 
tions  of  fuzzy  set  membership  functions. 

SUMMARY  OF  PREVIOUS  RESULTS 

In  this  section  a  brief  description  of  previous  results 
relating  random  set  theory  and  fuzzy  set  theory  as  well 
as  other  areas  Is  presented.  For  basic  definitions  and 
background,  see  FI0J,mj,[12),[7),H3J 

Theorem  1.  C7],[103 

Any  fuzzy  subset  A  of  ordinary  set  X  has, In  general, 
many  one  point  coverage  equivalent  random  set  represen¬ 
tations  S(A),  where  S(A)  Is  a  random  subset  of  X.That  Is 

»A(x)  »  Pr(xe$(A))  ;  all  xeX.  (1) 

Two  examples  of  this  Include  the  nested  random  set  Sy(A) 
and  the  very  broken-up  random  set  T(A),  where 


,/'  I 


■*  10,1]  Is  the  Membership  or  possibility  function 

for  A  and  , 

Sjj(A)  *  *A  [U.l]  .  O) 

while  the  random  membership  function  for  T(A)  Is 

*T(A>  “  {fT(A){x,,xeX  *  (3) 

where  each  *T^j(x)  Is  a  statistically  Independent 

zero-one  random  variable  with, for  all  xcX 

Pr(fr(A)^’1^“Vx);Pr^T(A)M"°)‘1**A^-  ^ 

Conversely,  any  random  subset  S  of  X  has  for  Its  one 
point  coverage  function  Pr(xeS)  as  a  function  of  x,  a 
possibility  function  for  some  corresponding  unique  fuzzy 
subset  A  of  X. 

Thus  the  fuzzy  subsets  of  a  given  space  X  partition  It 
Into  disjoint  and  exhaustive  parts,  each  part  corres- 
ponding  to  all  random  subsets  of  X  having  a  common  one 
point  coverage  function:  the  membership  function  of 
any  fixed  fuzzy  set.  ^ 

Theorem  2.  CIO] 

S„  and  T  as  mappings  from  the  collection  of  all  fuzzy 
subsets  of  a  given  space  Into  the  collection  of  all 
randoa  subsets  of  the  same  space  Induce  Isomorphic- 
like  relations  between  various  fuzzy  set  operators  and 
corresponding  random  set  operators-I.e.,  ordinary  set 
operators  on  random  subsets  of  the  sane  space.  For 
example,  where  «  Indicates  one  point  coverage  equlval.: 

^(SjjfA))  Su(f(A)J  ^f(A)  (5) 

SjjWVSytB)  a  Sjj(AUB)  nsAUB  (6) 

S^AjnSyfB)  *  S^AOB)  ftArtB  (7) 

X-«U(A)  a  Su(XiA)  ft  X-A  (8) 

proj(Su(A)J  ft  Su(proj(A)J  ft  proj(A)  ,  (9) 

all  fvivy  svk*rtsuA.V  of  usp»c«. 

Indeed,  Sj.  possesses  even  stronger  properties  In  that 
the  left-  sided  eaulvatences  can  be  replaced  by  actual 
equalities  for  eqs.(5),(6),(7),  and  (9).  Similar  results 
hold  for  T{ - )  ,  but  without  any  of  the  stronger  replace¬ 
ments  of  one  point  coverage  equivalence  by  equality. 

In  the  case  of  S„  »  the  fuzzy  set  operators  for  func¬ 
tional  transform”  f:X  *1,  unions  U,  Intersections  n, 
complements  X-,  •  ,  and  projections.  Involve  the  fuzzy 
set  system  F0“(*n,*..*orMl-(.).m1n,max),  for  comple¬ 
mentation,  Intersection,  and  union.  In  the  case  of  T, 
the  fuzzy  set  system  F.>(l-(*),prod,probsumj  Is  used 
for  all  definitions.  1  B 

More  generally,  fuzzy  set  system  F»(*  1  consists 

n  n  or 

of  a  triple  of  operators,  the  first  being  an  Involution, 
the  second,  a  t-norm,  and  the  third,  a  t-conorm  1101,1191 
Based  on  Infinitely-valued  set  theory,  ordinary  set 
and  logical  concepts  valid  In  classical  set  theory  and 
classical  logic  may  be  extended  to  a  fuzzy  set  form, 
for  any  choice  of  fuzzy  set  system.  For  example,  con¬ 
sider  as  above  f:X  ♦  T  and  any  ycY  and  any  fuzzy  subset 
A  of  X  : 

♦f(A)(y)  *  truth (yef(A)) 

•truth(0  xXtxcA)S(y-f(x))) 

•truth(  OrfccA)  4 <*ef -1  (y } } )) 
xcX 


•truthf  0r(xcAnf*1{y))) 
xeX 

•  *or(  truthJxcAflf'^y)}) 


♦^{truthKxeAWxcf-^y))}) 


’  ♦or(Vtruth(J"A) ,truth(xef  !(y) ) ) ) 
xcX 

"  ♦or(Wx),V1(y)(x))) 

XcX 

*  *or^Aftfl{y)fx^ 
xcX 

-  *0P  £♦*(*))  do) 

x*fl  fy) 


Note  that  when  F*F’  (see  Theorems  5(e)  and  b(bj;,  me 
above  result  reduces  to  the  classical  form  for  the 
transformation  of  probability  under  f.(See  also  (13).) 

Extending  the  concept  of  a  single  one  point  equivalence 
map-as  are  S„  and  T,  among  others-  Is  that  of  a  choice 
function  family.  This  Is  a  collection  of  Identical  maps 
S,  from  the  collection  of  all  fuzzy  subsets  of  any  space 
to  that  of  all  random  subsets  of  the  same  space,  such 
that  for  any  spaces  *nd  any  fuzzy  subsets  A,. 

...A  of  X.,..,X  ,  respectively,  the  correspondlna 
one  npo1nt1equ1v8lent  random  subsets  SjtA.J.-jS  (A  ) 

have  a  well-defined  distribution,  or  equivalently,  form 
a  well-defined  joint  collection  of  random  sets.  Two 
Important  such  families  are  :  S„  *  (S„  ,  -  .where 

It  u, 

Uj.Uj...  form  a  J-copula  stochastic  process  [101  and 
S^,  formed  from  repetitive  applications  of  ,  a  seml- 

dlstrlbutlve  t-normUOT. 

Theorem  3.  (101.H31.U53 

Theorem  2  can  be  extended  In  a  natural  way  for  the  two 
choice  function  families  described  above. 

However,  It  can  be  shown  that  there  exist  fuzzy  set  oper¬ 
ations  which  have  no  random  set  representation  through 
any  possible  choice  function  family.  (See  U3J.)  never¬ 
theless,  all  of  the  basic  fuzzy  set  concepts  and  their 
generalizations.  Including  systems  F_,  F,,  and  many 
others,  have  complete  random  set  representations. (Again, 
see  [13].)  If  a  fuzzy  set  operation  has  a  random  set 
representation,  then  that  representation  must  be  the 
restriction  of  the  fuzzy  set  operation  to  ordinary  sets. 

Theorem  A.  1103.(131.1151  ■ 

Given  any  choice  function  family  and  any  ordinary  (n- 
ary)  set  operation,  there  exists  a  unique  extension  to 
fuzzy  sets  such  that  a  random  set  representation  exists 
relative  to  that  choice  function  family.  In  particular, 
the  application  to  binary  compositional  set  operations 
generates  an  easily  computable  class  of  extensions.  ^ 

One  of  the  chief  difficulties  In  carrying  out  random 
set  representations  and  Interpretations  of  fuzzy  sets 
and  their  operations  is  the  non -uniqueness  of  corres¬ 
pondence.  In  general.  Infinitely  many  random  sets  can 
(one  point  coverage  equivalence)  represent  a  given 
fuzzy  set.  One  method  of  selecting  a  single  random  set 
representation  Is  through  the  measure  of  entropy: 

Theorem  5.  (16) 

Consider  any  fixed  discrete  space  X.  Then 

(a)  The  collection  of  all  random  subsets  S(A) 
representing  A  can  be  characterized  as  being  In  a 
bljectlve  relation  with  a  simply  descrlbable  con¬ 
vex  linearly  boundec  subspace  R(A)  of  Rc  *:here 
c  d  2card(X)_1_card()()< 

(b)  The  maximal  entropy  random  subset  representing 
A  Is  T(A). 


\'L 
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(c)  The  minimal  entropy  random  subset  represent¬ 
ing  A  occurs  In  a  collection  of  random  subsets  which 
1$  bljectlve  with  the  vertex  set  P(A)  of  R(A),  a 
relatively  sparse  set. 

(d)  There  Is  only  one  nested  random  subset  of  the 
space  representing  A,  namely  SU(A).  (This  result  Is 
true  for  any  space  X,  d1screteuor  not.)  S.,(A)  al¬ 
ways  lies  In  the  class  bljectlve  to  V(A),  although 
It  Is  not  always  necessarily  the  minimal  entropy 
representation. 

(e)  Note  first,  that  all  probability  functions 
and  all  cumulative  and  reverse-cunulatlve  probabil¬ 
ity  distribution  functions ,fbr  ordinary  and  defici¬ 
ent  probability  measures,  are  fuzzy  set  membership 
functions.  Let  A  be  any  fuzzy  subset  of  X  with  mem¬ 
bership  function  ♦,  which  Is  a  probability  function. 
Then, that  random  subset  of  X  which  Is  the  singleton¬ 
valued  one  formed  from  a  random  variable  having 
as  Its  probability  funct1on,repre$ents  A  and  lies" 

In  the  class  bljectlve  to  P(A).  although  as  In  (d). 
It  does  not  always  possess  the  minimal  entropy. 

■ 

Some  miscellaneous  results  are  summarized  next. 

Theorem  6. 

(a)  A  general  fuzzy  set  version  of  the  Law  of  Large 
Numbers  has  been  established  (17).  iFor  a  fuzzy  set 
version  of  the  Central  Limit  Theorem,  see  (10). ) 

(b)  Conditional  fuzzy  sets  may  be  defined  anala- 
gous  to  those  In  classical  probability  spaces  over 
discrete  spaces  [17] ,  In  turn,  yielding  a  fuzzy  set 
form  of  Bayes'  theorem  for  general  fuzzy  set  systems 
F  (131, (17).  This  may  be  used  to  develop,  from  first 
principles,  a  decision  theory  based  upon  fuzzy  set 
concepts.  In  particular,  fuzzy  set  membership  func¬ 
tions  may  be  used  to  model  error  distributions  of 
attributes  and  Inference  rules  connecting  grouos  of 
attributes  with  parameters  of  Interest.  The  rules 
can  Include  statistical  tests  of  hypotheses  converted 
Into  fuzzy  set  form  by  considering  their  random  sig¬ 
nificance  levels  as  fuzzy  set  membership  functions 
of  the  test  statistic  values.  This  Information  may 
then  be  combined  with  observed  data  to  yield  a  post¬ 
erior  fuzzy  set  describing  the  parameters  of  Inter¬ 
est.  (See  (201,  (211  for  applications  to  the  taroet 
data  association  problem-PACT  algorithm.)  When  all 
fuzzy  sets  Involved  In  the  decision  problem  are  also 
probability  functions  and  the  system  F»F'  ■ 
(l-(.),prod,bndsum)  Is  chosen,  then  all  of  the  above 
results  reduce  to  ordinary  probabilistic  ones. 


(c)  Characterizations  have  been  obtained  for  ran¬ 
dom  Interval  representations  of  fuzzy  sets  (16). 

(d)  The  Oempeter-Shafer  theory  has  been  shown  to 
be  completely  analyzable  through  random  set  theory- 
in  particular,  by  use  of  Choquet's  Capacity  theorem 
concerning  subset,  superset,  and  Incidence  functions 
of  random  sets  as  extensions  of  the  usual  properties 
of  probability  measures  In  expanding  In  alternating 
sums,un1ons  of  events  In  terms  of  Intersections,  and 
v1ce-vetsa.(See  (111,(71.) 


’e)  Although  Eytan's  claim  that  Zadeh's  version  of 
fuzzy  set  theory,  considered  as  a  formal  category, 
has  been  pointed  out  by  Pitts,  Carrega,  and  Ponasse 
not  to  be  a  topos  (see  (223, (03], (241),  Mag's  topos 
(as  discussed  In  (2SJ)  promises  to  be  a  fruitful 


medium  for  extending  the  concept  of  membership  In  a 
fuzzy  set  to  simultaneous  membarshlp  of  several 
points  (In  a  Higgs  set)  slncefa  sound  and  complete 
Intultlonlstlc  logic  can  be  realized  through  the 
topos:  It  Is  a  natural  extension  of  the  concept  of 
a  set,  and  Indeed  a  fuzzy  set  (  for  single  point 
membership)  and  Is  conducive  for  axlomlzatlons 
(although  see  Goguen's  earlier  axlomlzatlcn  of 
Zadeh’s  fuzzy  set  theory  as  a  category  (261  )*ano 
finally.  It  can  be  shown  to  be  the  quotient  object 


completion  and  union  of  e  class  of  natural  Imbeddings 
of  various  forms  of  Zadeh's  fuzzy  set  theory  for  multi¬ 
ple  memberships  (221  ,D7 1,  Apropos  to  multiple  point 
membership  of  fuzzy  sets.  Theorem  5  (a)  has  been  ex¬ 
tended  to  the  characterizations  of  all  random  subsets 
of  X  which  are  multiple  point  coverage  equivalent  to 
a  given  multiple  point  membership  function,  up  ti  -„iy 
prescribed  multiplicity  level  (161.  ^ 

Finally,  we  mention  the  fundamental  result  previously 
obtained  concerning  how  evidence  should  be  combined. 

Theorem  7.  (19 1,(201:  (If  J,  for  asymptotic  properties. 

Given  any  collection  of  fuzzy  subsets  A., ...A  of  a 
space  X  describing  some  common  unknown  parameter 
vector,  for  each  choice  of  non-decreasing  combining 
function  g:fO,ll*-r  (0,1J  with  respect  to  any  confi¬ 
dence  levels  of  the  corresponding  level  set  forms, 
there  Is  a  unique  single  fuzzy  set  description  of  the 
parameter  which  minimally  contains  the  Information 
given  by  A, ,..,A .  This  fuzzy  set  A  Is  determined  by 
the  equation  "  ^1} 

♦»(x)  d  g (♦,  (x)...,0.  (x))  ;  all  xtX. 

"  t  "n  ■ 

By  noting  that  any  t-nona  satisfies  the  conditions  for 
g  In  the  above  theorem,  an  application  may  be  made  to 
a  large  class  of  data  association  problems,  where  It 
can  be  shown  that  the  fuzzy  Bayesian  solutions  of  the 
problems  coincide  with  the  combination  of  evidence 
approach  determined  by  eq. (11) .  (See  the  remarks  In 
Theorem  6(b)  concerning  the  PACT  algorithm.) 

FUZZY  SET  MEMBERSHIP  FUNCTIONS  AS  EVALUATIONS 

OF  RANDOM  VARIABLES  OVER  COMPOUND  EVENTS 

j 

The  results  of  the  previous  section  point  out  that  all 
fuzzy  sets  have  (In  general,  many)  random  set  one  point 
coverage  representations  and  that  the  most  common  fuzzy 
set  operators  also  have  Isomorphic-like  random  set  op¬ 
erator  correspondences  (under  the  one  point  coverage 
relations).  In  this  section,  we  present  a  dual  result 
with  respect  to  random  variables  over  some  Initial 
elementary  event  space.  It  Is  first  shown  (Theorem  8) 
that  given  a  random  variable  V  over  a  space  X  and  given 
any  collection  A  of  compound  events  from  X.I.e.,  A  Is 
a  collection  of  subsets  of  X  lying  In  the  o-algebra  on 
X,  a  random  subset  S  of  A  may  be  constructed  such  that 
notlonly  Is  the  one  point  coverage  function  of  S  tne 
same  as  the  evaluation  function  for  V  over  A,  but  also 
that  the  structure  of  S  Is  natural  with  respect  to  V, 

1 .ti,  any  outcome  of  S  Is  the  filter  class  of  A  over  a 
corresponding  outcome  of  V.  The  next  result  (Theorem  9) 
shows  the  converse:  given  any  random  subset  S  of  any 
given  collection  A,  an  elementary  event  space  X  for  A 
may  be  constructed  as  well  as  a  random  variable  V  over 
X  such  that  the  situation  In  Theorem  8  holds.  The  con¬ 
struction,  In  general  Is  non-unique.  Finally,  these  re¬ 
sults  are  combined  with  Theorem  I  to  show  that  any  fuzzy 
subset  of  a  space  may  be  represented  dually  (In  many 
ways.  In  general,  depending  on  the  choice  of  S)  by  both 
the  evaluation  function  of  a  random  variable  over  com¬ 
pound  events  and  as  the  one  point  coverage  function  of 
a  random  subset  of  the  same  space,  with  the  random 
variable  and  random  set  In  a  natural  relationship,  as 
mentioned  above. 

First  a  basic  lemma  and  definitions  are  presented. 

Lemma  and  Definitions. 

Let  X  be  any  space  and  ASP(X),  the  class  of  all  ordin¬ 
ary  subsets  of  X.  Let  ¥:£.-*•  X  be  any  random  variable 
relative  to  probability  space  (n,C,Pr)  and  some  measur¬ 
able  space  (X,8)  Inducing  probability  space  (X,8,Pr»V*l) 
with  o-algebra  8  5A. 

Let  S:  Q  -  P(A)  be  any  random  subset  of  A  relative  to 
measurable  space  (rng(S).o(S)),  rng»£P(A).generated  by 
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G{S)  &  {Cj#)(rng9)  |  a  e  A)  (12) 

where  the  filter  class  of  rngEJon  point  a  Is  given  by 
C{aj{rngg))  ^  (B  |  a  e  B  erngU}  ,  (13) 

for  any  a  t  A.  Mere  general ly/the  filter  class  of  rn<£) 
over  any  C  t  P(X)  Is  given  by 

Cc(rngS)  d  {  B  |  C=B  crngQ)  .  (14) 

(See  (13)  and  (281  for  background  m  random  sets.) 

Thus  S  induces  probability  space  tng(S) .ct(S) ,Pr*S_1) . 
Then:  ,  , 

V-1(a)  «  S*1(Cjaj(rngS)).a11  a  t  A  (IS) 

Iff 

S(«)  *  C(V(U)}W  .  all  u  e  n  .  (16) 

which  mutually  Imply 

Pr(Y-l(a))  -  Pr(S-l(C{a}(rngfi)))  .  (17) 

Pr(V  c  a)  -  PrU  c  S)  .  all  a  c  A.  (18) 

In  addition,  for  any  collection  of  sets  such  as  G{$),  a 
basis  ^S)  may  be  formed  as  follows: 

(1)  let  A'  A  be  such  that  C,  ,(rng(S))  is  a  one-to- 
one  function  of  a  e  A'  ,  yet 

G(S)  •  {  Cjaj(rngQ)  |  a  i  A'l.  (19) 

(2)  For  any  B  c  P(A'),  define 

»B  d  CB(rng$))-tlJ  C{a)(rngRJ.  (20) 

aeA'aB 


and  similarly  for  S,  yielding  Immediately 

Pr(V  c  })  •  Pr(a  c  S),  all  a  e  A  .  (25) 

In  addition: 

(I)  a  must  always  satlsfy-and  therefore  may  be  de¬ 
fined  by,  once  V  and  X  are  determlned- 

8  -  VtS-^jfrngB))),  all  a  .  A.  (26) 

(II)  One  choice  of  V  and  X  Is: 

V  S  S  ,  X  i  rng(S),  (27) 

In  which  case  A  reduces  to 

C{a)(rngB),  all  a  e  A.  (28) 

(ill)  The  general  solution  for  V  and  X  (and  hence 
a  via  eq.(26))1s  constructed  as  follows: 

From  the  lemu  (5)  for  each  B  c  H  let 

VB  :  S-!(BB)  *  XB  (28) 

be  arbitrary  surjective  (and  measurable),  with  Xg  arbi¬ 
trarily  chosen  non-vacuous  set  such  that  all  X_  are 
disjoint.  Then  define 

X  i  U*b  129) 

BcH 

and  define  V:0  ♦  X  by:  for  any  men,  there  Is  a  unique 
B(u)  c  H  such  that  ux  S"1  (Bg)  .whereupon  define 

V(u)  tf  all  «  e  0  .  (30) 

(1v)  It  follows  from  (111)  that  the  smallest-by 
subset  Inclusion-  possible  space  X  satisfying  the  re- 
qlred  properties  Is  of  the  form 


(3)  let 


f(S)  d  {  Bfl|  B  c  H  }  , 


(21) 


be  the  collection  of  all  non-vacuous  distinct  (and  hence 
disjoint)  Bg  *s,  noting  that  HSP(A)  may  be  non-unique. 

(4)  It  follows  that  ^?(S)  forms  a  (disjoint  and  ex¬ 
haustive)  partitioning  of  1>g(s),  where  for  any  a  c  A', 


X  *  (xB  |  B  c  H  >,  (31) 

where  each  x.  represents  ».  point  which  Is  distinct  for 
each  B. 

(v)  Finally,  note  that  In  (11),  X.  ■  Bn  ,  for 
all  B  e  H  .  *  " 


C{a}(rngB>  -  O  ®B  (disjoint).  (22) 
acBcH 

(5)  In  turn,  this  Implies  that  {S*l(Bg)|B  c  H}  Is 
a  partitioning  of  n  ,  where 

S-^Bg)  •  (S-^CJl  C  c  bbi 
ind  . 

S“((C)  ■  {u  |  u  e  <1  6  S(m)»C), 
by  the  usu.l  Inverse  functional  notation.  ^ 

Theorem 8 Random  variables  generate  naturally  corres¬ 
ponding  random  sets  whise  one  point  coverage 
functions  match  the  evaluation  functions  of 
the  random  variables  over  a  given  set  of 
compound  events. 


Proofs  :  Using  eqs.(22),(28)-(30) 

i  -  V(S-!(  Qbb))  -0  VS'1(V>  ■  U  XB  (31) 

acBcH  acBcH  acBcH 

v_1(J)  -  »_I(OV  “US_1(V 

acBcH  acBcH  acBcH 

■  S‘1(^)Bb)  -  S_1(C{a}(rngP)));all  aeA.(33) 
acBcH 

Eq. (24)  Is  obtained  from  the  Injective  property  of  a 
over  A'  and  use  of  eqs.(15)  and  (16)  In  the  leimia. 

The  remainder  of  the  results  follow  Immediately  from 
the  constructions.  m 


let  V:G  -*  X  be  a  random  variable  with  A  arbitrary,  where 
ASBSP(X)  as  In  the  lemma.  Then  define  S:0  ♦  P(A  )  by 
eq.(16).  The  lemma  Implies  eqs. (15)-(18)  hold.  ^ 

Theorem  9.  Converse  of  Theorem  8:  Any  random  set  gener¬ 
ates  a  naturally  corresponding  random  vari¬ 
able  and  an  elementary  event  space  such  that 
the  random  variable  evaluated  over  certain 
of  the  compound  events  matches  the  one  point 
coverage  function  of  the  random  set. 

let  only  A  be  given  with  some  random  subset  S:fi  *  P(A). 
Note  the  validity  of  the  development  In  eqs.(19)-(22). 
Then  there  exists  a  space  X  (corresponding  to  elementary 
events),  a  random  variable  V:R  ♦  X  surjective,  and  a 
mapping  A:A  ♦  P(X)  which  Is  Injective  over  A'  such  that 

rh»)  *  S-^C^jJrngfl)),  all  a  c  A  (23) 

and  equivalently, 

S(w)  *  c{y(„)}(^)  ,  all  u  c  n  ,  (24) 

where  as  usual  A  i  (a  |  a  t  A  }  In  functional  form, 


Theorem  10.  Any  fuzzy  subset  of  a  space  has  both  a 
random  subset  representation-under  one 
point  coverages,  and  a  random  variable 
representation  as  the  evaluation  function 
of  the  random  variable  over  hm  class  of 
compound  events,  with  the  random  subset 
and  random  variable  naturally  related. 

let  8  be  any  fuzzy  subset  of  some  given  A.  Thus, 

♦B:A  ♦  (0,1J  Is  arbitrary  given.  Then  by  Theorem  1, 

there  always  exists  random  sets  S:0  *  P(A)  which  are 
one  point  coverage  equivalent  to  8.  In  turn,  applying 
Theorem  9,  there  Is  a  random  variable  V:n  X  ,  with 
V,  X,  and  mapping  A:A  ♦  P(X)  all  satisfying  the  results 
in  Theorem  9.  In  particular,  this  Implies 

$„(a)  *  Pr(a  i  5)  *  Pr(V  c  f),  ell  i  c  A.  (34) 

6  ■ 

Analogues  to  Theorems  2,3,4  for  random  variable  repre¬ 
sentations  of  fuzzy  sets  have  yet  to  be  Investigated. 
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Example  Illustrating  random  variable  and  random  set  rep¬ 
resentation  of  fuzzy" sets. 

In  real  world  applications,  classes  of  objects  are  deter¬ 
mined  by  »•  >y  factors  where  often  we  have  no  knowledge 
of  the  actual  conditional  and  joint  probabilities  In¬ 
volved  a -  mg  the  factors  (let  alone  determine  all  of  the 
relevant  factors).  In  such  cases.  It  may  be  both  simpler 
and  more  appropriate  to  query  experts  directly  as  human 
Integrators  of  knowledge  to  obtain  the  probabilities  or 
possibilities  of  occurences  of  t’:e  classes  Involved  and 
their  error  distributions.  Results  from  these  experts 
show  apparent  probabilities  for  these  events  not  adding 
up  to  one  In  general.  In  the  past,  normalization  was 
carried  out  to  make  these  values  1nto"1eg1t1mateuprob- 
ablllty  ones,  with  the  Implication  that  human  error  was 
Involved  in  the  estimates  which  contributed  to  this 
problem.  However,  since  In  general,  the  classes  of  In¬ 
terest  really  represent  overlapping  concepts,  and  not 
disjoint  events,  the  computed  values  should  not  be  ex¬ 
pected  to  sum  to  unity.  Hence,  a  fuzzy  set  membership 
(or  possibility)  function  Is  more  appropriate  than  a 
classical  probability  function  model.  Indeed,  Theorems 
8-10  make  this  idea  more  rigorous: the  experts’  respon¬ 
ses  (suitably  averaged)  form  the  fuzzy  set  membership 
function  which  Is  the  same  as  the  evaluation  function 
of  a  (non-unique)  random  variable  representing  the 
actual  elementary  event  space  of  all  factors  evaluated 
at  those  compound  events  of  that  space  which  corres¬ 
pond  to  the  classes  considered.  The  evaluation  function 
may  also  be  Interpreted  as  the  connonalltles  or  one 
point  coverage  function  of  a  (non-unique)  random  subset 
of  the  set  of  all  classes-  the  random  subset  represent¬ 
ing  the  possible  Interactions  of  the  classes. 

First,  for  simplicity,  consider  the  situation  described 
In  Theorem  8:no  experts  used  and  all  prob.  values  known: 

Let  Gjd  {a.b.c},  G^d.II},  G^  {1.2},  be  the  domain 

sets  of  the  only  factors  considered  for  possible  class¬ 
es.  For  example,  G.  could  represent  lengths,  G.,  weights 
(heavy,  light), and1G,,  shapes  (of  type  1  or  type  2). 
Define  then  the  classes  C.,..,C,  by  the  following  table: 


I  Class  C. 


Gjj*Gj  values 


jG2j»G2  valueslGjj-Gj  values 


— 

a,b 

I 

1.2 

C2 

b.c 

I 

1 

C3 

«.C 

II 

1.2 

Ca 

a,b 

II 

1,2 

C5 

a.b.c 

II 

1 

C6 

c 

I 

2 

Table  1. 

Define  the  following  elementary  events -possible  triples 
of  values  from  G{,G7,G3  that  determine  the  classes: 

Xj*(a,1 ,1 )  |x4*(a,II,2)  x7-(b,II,l)  Xj0-(c,I,2) 


x4*(a ,11,2) 

x7-(b,II,l) 

x5-(b,I,l) 

x8-(b,II,2) 

Xg»(b,I,2) 

Xg* (c ,1,1) 

Thus  we  can  tabulate  for  eacn  class  which  elements  are 
In  It.andiConversely,  for  each  element.whlch  classes 
contain  It: _ _ 

Class  Cj  Elements  In  C,  Class  Cj  Elements  In  C^ 

C  *2’  *5’  *6  C4  *3’  x4*  x7’  x8 


5*  9 

x3*  x4,xll,x12 


"  »6 
Table  3. 


Here,  X»{Xj,x2,..,Xj2)  ,  A»(Cj,..,Cg}£P(X)  ,  and 
the  collection  of  all  classes  containing  Xj  Is  Cjxjj(A). 


x3ltC3,C4,C5 


C{Xj)(A}  *j 

VA1 

XJ 

VA1 

(C3,t4)  x? 

{c4.cs> 

X10 

{Cj ,C2>  Xg 

tc4} 

*11 

{c3.cs} 

{C1}  P*g 

tc2) 

x12 

{C3> 

Next,  let  V,  be  a  random  variable  over  G,  corresponding 
to  the  trueJ value  of  factor  j  and  V,  a  random  variable 
also  over  G,  corresponding  to  the  observed  value  of  fac¬ 
tor  j,  for  Jj»l,2,3.  Suppose  also  that(¥, .V-.V.Jare  a 
mutually  statistically  Independent  triple,  is  are  : 
(?l,?2,y j).(V1,f2,l3).(tl.»r»3).  and  (fj.^.Vj).  Sup- 

further  that  the  probability  functions  p(V. |9.)  and 
p(9.)  ,  J*l,2,3,  are  all  knov  ,  where  we  use  J  the  con¬ 
vention  of  Identifying  ,  where  necessary,  random  varia¬ 
bles  and  their  probability  functions  through  their  typ¬ 
ical  outcomes.  Define  random  variable  V^(Vj,V2,V3) 

corresponding  to  the  true  joint  factor  values  determin¬ 
ing  the  true  class,  and  similarly,  vR(9j,v?,v,)  for  the 
observed  (or  reported)  class.  Then  It  followsJ  that: 

P(V  |  V  e  ck)  - 

fTTZU  p(v,|v.»t).p(v,«t^  /ITT]  p(v.»t)  Os, 
y-i  t£Gjk  3  3  3  *  ’  j-i  usjk  3 

and  •  A 

ptt^  Is  true|Ck  Is  observed )*p(¥eCjVeCk;fl4B(ej¥eCk)  « 

P(Vi*t'|V,-t).p(V.»t))  / 

^■l/tcGjkS  J  i  j  /'  J  (36) 

Thus,  p(Cm  true|Ck  observed)  Is  a  computable  function  of 

C  for  each  C  In  A  and  may  be  Interpreted  as  the  fuzzy 
set  membership  function  or  possibility  function  for 
possible  choices  of  which  class  gave  rise  to  the  obser¬ 
vation  Ck,  for  any  fixed  k  ,  k«l,2,..,6.  These  functions 
are  generated  by  the  conditional  random  variables 
(¥  |  V  c  C. )  evaluated  over  the  collection  of  compound 
events  A.  In  turn,  the  possibility  functions  are 
also  represented  as  the  one  point  coverage  functions  of 
the  random  subsets  (S  |  7  e  C. )  of  A  which  represent 
the  possible  Interactions  of  the  classes. k*l,2,.., 6. 
Here,  for  any  k,  a  typical  outcome  of  (S  |  ¥  e  Cfc)  Is 
some  c.  i ( A) ,  the  collection  of  all  classes  C. ,  C-,.. 


_ Table  4. 

be  a  random  variable  c 


some  C^x  j(A),  the  collection  of  all  classes  Cj, 

such  that  they  interact  with  respect  to  x,  ,  l.e. 
tain  x,.  Table  4  presents  the  ten  distinct  such  c< 
Ions  Jof  classes  making  up  the  range  of  (S  |  ¥  t 


l.e.,  con¬ 
ch  collect- 


The  probability  function  for  this  random  subset  Is  ob¬ 
tained  from  eq.(16).  Some  of  the  values  (see  Table  4  ): 

p(S-{C.)  |  tf  e  C.  )  -  V  (  p(¥-x.  |  ¥  c  Ck))  , 

1  *  i-1,2,6 

p(S-{C3,C4,C5)  j  t  a  Ck)  -  p(¥-x3  |  ¥  c  Cfc), 

p(S-{C3,C4}  |  ¥  c  Ck)  -  p(V»x4  I  l  «  Ck), 
p(S-{Cj,C2)  |  V  £  Cfc)  «  fKv**c  I  V  e  Ck), 
p(S«{Cg  |  5  £  Ck)»  p(¥-x10  |  ¥  £  Ck),  etc.,  (37) 
which  may  be  evaluated  through  eq.(35). 

Now  consider  the  converse  of  the  above  sltuatlon-where 
here  Theorem  10  Is  appropriate:  Ke  know  the  classes  of 
relevance  A  •  {C.,..,Cg},  but  we  do  not  know  all  the 
contributing  factors  and/or  we  do  not  have  a  handle  on 
all  the  required  probability  functions  Involved  In  the 
factors;  but  we  do  have  before  us  a  panel  of  experts 
who  will  give  the  possibilities  of  true  class  values 
given  observed  ones.  To  simplify  notation,  assume  that 
a  particular  Ck  Is  chosen  and  all  results,  as  above, 
are  condition  id  upon  the  event  ¥  c  Ck.  Suppose  then 


the  possibilities  of  a  particular  class  being  the  true 
one,  given  Ck  Is  observed,  is  determined  by  the  panel 
to  representK1n  effect  the  fuzzy  subset  B  of  A  which  Is 
given  through  the  membership  (or  possibility)  function 
0-  :  A  ♦  (0,11,  where  without  loss  of  generality 
8  0  <  ^(Cj)  <  fB(C2)  <•••<  ♦gtCg)  -  1  .  (38) 

Ke  first  must  choose  a  random  subset  S  of  A  (such  as  In 
eq. (37)-but  here  we  do  not  know  a  priori  the  random  var¬ 
iable  V  generating  S  ard  *B).  Guided  by  Theorem  1  and 
perhaps  by  Theorem  5,  by  minimal  entropy  or  nested 
random  set  form  requirements,  choose  S  ■  Sy(B),  e,g.  In 
this  case,  the  probability  function  for  S  is,  by  defin¬ 
ing  for  m»l,.., 6, 

C  -{Cm'Cm+l’— C6}  *  ♦B«V0*(39) 


p(S-C(m))-fB(C|||)-fB{Cm_1),  m*l,..,6, 
ThuS.rng(S)  -  (C("'>!m-l,..,6}Cp(A)  and  A-A’  with 

*  ««  a  «  a 


(40) 


C{C  }(mgS) 

£( S)  -  {  c(1),c(2),..,c(6h. 

Noting  here  for  S:fl  ♦  P(A),  we  may  choose  0  ■  [0,1], 
with  U(u)  ■  u,  for  all  u  e  a.  Then  It  follows  that  S 


whence 


(41) 

(42) 


r*icM) 


(♦B(C_  i),4b(c_)];  (43) 

and  random  variable 

(surjective)  V{m) z ) ,*B (Cm) J  -  X(b)J(44) 
with  each  X,  .arbitrary  disjoint  nonvacuous  setjwlth 

W  ,  f 

V  defined  from  the  Vjbj*s  *s  In  eq.(30),and  xS&X(m)  *• 

and  the  Injettlve  mapping  A:A  •*  P(X)  Is  as  In  eq.(32), 
with  eq.(34)  holding  with  a  replaced  by  any  C  e  A.  Alsq 
all  of  the  remarks  In  Theorem  9  may  be  applledrto  the 
situation  here,  such  as  choice  of  S«V  and  X«rng(S),etc., 
as  special  cases. 

SUMMARY  AND  CONCLUSIONS 

Natural  connections  have  been  established  between  fuzzy 
set  membership  functions  and  both  evaluation  functions 
of  random  variables  over  comoound  events  and  one  point 
coverage  functions  of  random  subsets  representing  Inter¬ 
actions.  This  allows.ln  particular,  two  essentially  e- 
qul valent  views  to  be  taken  from  a  C3  systems  approach: 
uncertainty  modeling  may  be  carried  out  In  either  a  fuz¬ 
zy  set  or  probabilistic  framework,  whichever.  Is  most 
feasible  or  appropriate,  and  that  Information  may  be 
combined  within  an  all-probabilistic  or  all-fuzzy  set 
model  for  further  analysis.  For  example.  In  modeling 
natural  language  descriptions .often, Infinite- valued  lo¬ 
gic  and  Its  realizations  through  general  fuzzy  set  theo¬ 
ry  are  easier  to  use,  rather  than  probability  theory 
dlrectly-al though  all  results  may  be  translated  Into  the 
Tatter.  For  example,  consider  the  sentence :S®  "John  be¬ 
lieves  the  following  Information  must  be  sent  to  the 
commander  rover  the  oast  24  hours,  most  of  the  ten  or  so 
detections  were  of  gray  and  unusually  long  ships."  Then, 
denoting  truth  evaluation  bv  tr(»)  (see  also  eq.(10)): 
tr(S)«*.(tr(a),tr(b))  ;  b<*  mostfc  Is  d fi  X"  set  of  tjtj 
tr(a)»tr(3  bel.  b  shld.be  sent  to  comnO 

•♦bel|J(,Jor5*i(*send,a(,>*co,nn)i;Vry(*obl1g(a))i)] 
cB10  or  so  det.  mrae  over  pst.24  hrs;dflgry.  4  unus.long.. 
For  all  xeX,  denoting  all  time  Indices  by  t  and  t': 
♦D(x)-tr(x  det.  over  pst^hrsV^U^txigt^ 

”  ‘ ,w- 

1  n  tr(c4d)«g(E), 

♦F(x)“tr(x  det.ovr.pst.24hrs.4  x  is  gray  &  unus,  long) 
-♦&(*„(x)  ,*aray(col  (x ) )  .♦^ry(fl0T1Jlngth(x ) ) ) ) . 

This  determines  the  conditional  tr(d(c)  through  relation 

In  turn  tr(cld)  -  ♦.(tr(c),tr(d|c)). 

In  turn,  tr(b)  .  ♦Jnost (4tr(d | c) ) , 

which  completes  the  evaluation  of  tr(S).  This  may  also 
be  expressed  In  probability  form, by ,e.g. (Theorem  3)  re¬ 
placing  all  4A(x)  by  xeS^Ajfy  by'4(  *or  by 'or*  etc. 
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Introduction 


■^The  conventional  way  to  estimate  quantities  for 
decision  making  purposes  Is  to  employ  Informed 
judgment  about  the  applicable  estimating  relationships 
(equations,  algorithms,  models)  and  the  values  of 
their  constituent  parameters.  More  often  than  not, 
one  or  more  key  parameters  are  not  known  with 
certainty,  and  Informed  judgment  takes  the  form  of  a 
subjective  point  estimate  {"guesstlmatfc*).  Many 
practitioners  are  content  to  use  the  resulting  point 
estimate  for  the  result.  The  more  diligent  will  re-do 
the  calculation  combining  estimates  of  upper  and  lower 
bounds  of  the  Input  parameters  to  estimate  upper  and 
lower  limits  for  the  result.  Such  dllllgence  is 
facilitated  by  the  Increasingly  convenient  spread 
sheet  computational  packages  for  personal  computers. 

This  paper  describes  software  for  Incorporating 
uncertainty  directly  and  explicitly  Into  the 
calculations  to  yield  a  probability  distribution  as 
the  result  rather  than  a  set  of  point  estimates.  The 
software  permits  the  user  to  represent  the  uncertain 
parameters  by  any  of  about  sixteen  probability  density 
funct1ons.r=^Th1s  Is  done  with  simple  assignment 
statements  11ke>-^_ 

'A'  IS  *N0R  10  l7 

which  represents  the  parameter  A  as  being  normally- 
distributed  (bell-shaped  curve)  with  a  mean  of  10  and 
a  standard  deviation  of  1.  The  parameters  represented 
by  probability  distributions  rather  than  point 
estimates  are  called  probabilistic  variables.  The 
software  combines  the  probabilistic  variables  In 
expressions  almost  like  those  Involving  deterministic 
variables,  except  that  Instead  of  using  the  familiar 
arithmetic  operators  {♦,  -,  x,  ♦,  *)  corresponding 
primitive  functions  like  PL  ("Plus",  for  addition),  MI 
("Minus",  for  subtraction),  TI  ("Times",  for 
multiplication),  01  (“Divided  by",  for  division),  and 
EX  for  exponentiation  are  substituted.1  Thus,  the 
expression 

D«(A  +  B)c, 

Is  written  as 

D*— (A  PL  B)  EX  C  , 

assuming  that  A,  B  and  C  have  all  been  assigned 
definitions  as  probabilistic  variables.  And  the 
result,  0,  Is  Itself  a  probability  distribution  rather 
than  a  point  estimate. 

A  probability  distribution  provides  much  more 
Information  than  a  single  estimate,  or  even  an 
estimate  with  upper  and  lower  bounds.  For  example, 
the  P-th  ..entile,  the  value  which  Is  greater  than  P- 
percent  of  the  other  possible  values  of  the  result, 
can  be  found  directly  from  the  probability 
distribution  of  the  result/  Similarly,  summary 
statistics  or  graphical  representations  of  the 
probabilistic  result  can  also  be  produced.3 


Perhaps  one  of  the  most  useful  features  Is  the 
ability  to  compare  probabilistic  results  which 
represent  competing  alternatives.  Often,  the  expected 
result  (mean  of  the  probability  distribution)  Is 
Independent  of  the  amount  of  variability  (spread, 
dispersion)  of  the  result.  It  Is  not  unusual  to  find 
circumstances  where  the  alternative  that  has  the 
greatest  expected  value  also  has  the  highest 
dispersion.  Greater  dispersion  can  mean  greater  risk; 
In  fact  risk  Is  often  described  as  being  directly 
proportional  to  some  (exponential)  power  of  a  measure 
of  the  dispersion  such  as  the  variance  —  the  bigger 
the  spread,  the  greater  the  risk,  by  some 
definitions/  The  decision  maker  may  place  a  premium 
on  risk  reduction  and  may  be  willing  to  trade  off 
expected  performance  in  order  to  reduce  the  risk  of. 
the  system's  not  performing  adequately  In  adverse 
situations/ 

So,  to  take  stock  before  moving  on,  this  paper 
will  describe  a  computerized  method  for  representing 
uncertainty  explicitly  In  calculations  concerning 
individual  alternatives  and  for  making  choices  among 
several  uncertain  alternatives. 

Belated  Work 

I  credit  the  early  motivation  for  the  work  to  my 
research  with  M.  Granger  Morgan  at  Carnegie -Me lion 
University.6  Morgan  has  made  many  practical 
contributions  to  the  art  of  dealing  explicitly  with 
uncertainty  In  public  policy  Issues  with  high 
technology  content.  He  and  his  colleague.  Max 
Henricn,  have  developed  a  software  system  with  similar 
capabilities  which  they  call  DEMOS.  They  are  readying 
DEMOS  for  general  use  with  support  from  the  National 
Science  Foundation.  DEMOS  Is  coded  In  Pascal,  a 
higher  order  computer  language  which  Is  viewed  by  many 
as  a  root  of  the  new  Ada  language.  ^ 

In  other  related  work,  Leo  H.  Sroner  of  IBM7  has 
been  Investigating  a  number  of  computerised  methods 
for  pe*form1ng  explicitly  probabilistic 
calculations.  Some  are  similar  to  the  approach  I  have 
taken,  though  Groner  uses  a  different  representation 
of  probability  density  functions.  Much  of  his  work  Is 
also  coded  In  APL.  1 

Also,  Bonner  and  Moore  Associates  Inc., \  Houston, 
Texas8  offer  a  commercial  FQRTRAN-based  I  software 
package  for  probabilistic  calculations.!  The 
calculations  are  performed  with  Monte-Carlo  simulation 
techniques.  They  have  named  It  PAUS.  PAUS  Is 
Intended  for  a  variety  of  business  applications.  It 
also  appears  adaptable  to  engineering  applications. 

My  own  system  Is  In  an  advanced  development 
stage,  ready  for  application  testing.  It  Is  based  on 
a  numerical  approadi  which  is  similar  to  the 


/ 
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convolution  of  probability  distributions.  My 
objective  Is  to  enable  analysts  to  systematically  and 
explicitly  Incorporate  uncertainty  Into  their 
calculations  with  a  minimum  of  set-up  overhead. 

An  Illustration  of  an  Explicitly  Probabilistic 
Calculation 


A  sample  problem  solution  will  be  presented  to 
Illustrate  both  the  way  the  software  can  be  used  and 
the  extra  dimension  of  information  that  Is  available 
from  following  the  propagation  of  uncertainty  through 
a  calculation.  Consider  a  system  with  two  types  of 
elements  with  unit  operating  costs  A  and  B, 
respectively.  The  total  system  operating  cost  Is 
described  by  the  formula 

D  *  (M  x  A  +  Nx  B)c, 

where  M  Is  the  number  of  units  of  the  first  type  In 
the  system,  and  N  is  the  number  of  units  of  the  second 
type.  C  Is  an  exponent  which  represents  the  Influence 
of  scale  on  costs,  l.e.,  economy  (C  less  titan  1)  or 
diseconomy  (C  greater  than  1}  of  scale. 


Imagine  one  system  composed  of  one  unit  of  the 
first  kind  and  35  units  of  the  second  kind.  Its 
operating  cost  would  then  be: 


E  »  (A  ♦  35  x  B)c. 


A  second  system  alternative  with  no  units  of  the  first 
kind  and  75  of  the  second  ! kind  would  have  the 
following  operating  cost:  1 

F  -  (75  x  3)C. 

The  first  type  of  Item  could  very  well  be  a  large 
central  component  such  as  a  mainframe  host  computer 
and  the  second  Item  type  could  be  a  smaller  remote 
counterpart  like  a  local  processor.  Acutal  cost 
equations  would  be  more  complicated,  but  similar  In 
principle  to  the  Illustration. 


Suppose  an  analyst  had  made  (or  had  obtained)  the 
following  point  estimates:  A  *  10;  B  ■  0.24;  C  ■  0.63. 

The  cost  estimates  would  then  have  been: 


E  - 
F  • 


*  6.26,  and 
-  6.18. 


These  differ  by  1.3  percent,  i  certainly  too  close  a 
call  for  a  clear  preference.  ; 


Suppose,  further,  that  Instead  of  just  point 
estimates,  the  analyst  was  able  to  encode  the 
uncertainty  In  the  estimates  of  the  parameters  with 
the  following  probability  density  functions  (PDFs): 


A**  NORMAL  (mean  *  10;  standard  deviation  *  1) 
B~  GAMMA  ("shape*  -  2;  "scale"  -  7) 

C—  TRIANGULAR  (low  ■  0.575;  apex  « 

0.6;  high  *  0.75). 


The  probabilistic  parameters  will  yield  a 
distribution  rather  than  a  point  estimate  for 
operating  costs  E  and  F.  This  Is  how  It  Is  done  with 
the  software: 

1.  tnter  the  J>OFs  for  A,  B  and  C,  as  shown  In 
Exhibit  l.8 

2.  Enter  the  expressions  for  E  and  F,  as  shown 
In  Exhibit  2.  Where  probabilistic  variables 
are  combined,  substitute  the  functions  "PL" 
for  addition,  "MI*  for  subtraction,  "DI"  for 
division,  "TI"  for  multiplication,  and  "EX" 
for  exponentiation.  Note  that  In  an 
operation  Involving  a  constant  and  a 


probabilistic  variable,  the  regular 
mathematical  operator  may  be  used  (e.g.,  35  x 

3). 

3.  The  statistics  of  the  resulting  distributions 
are  generated  with  "DSTAT"  and  Illustrated 
with  "BOXPLOT*.  Also,  vertical  boxplots  for 
the  two  results,  E  and  F  can  be  generated 
side-by-side  (to  the  same  scale)  with  the 
function  "COMPARE*  [McNeil,  1977]  as  shown  In 
Exhibit  2. 

Referring  to  Exhibit  2,  one  sees  that  although 
the  two  alternatives  have  about  the  same  expected 
operating  cost  (as  was  predicted  by  the  original  point 
estimates  and  borne  out  by  the  medians  and  means  of 
the  probabilistic  results)  alternative  F  has  about 
twice  the  dispersion  (as  measured  by  the  standard 
deviation)  as  alternative  E.  So,  the  choice  between 
the  two  depends  on  whether  the  objective  Is  to 
minimize  uncertainty  (E  Is  better),  minimize  the 
highest  possible  cost  (E  Is  better),  minimize  the 
lowest  possible  cost  (F  Is  better),  or  some  other 
objective.  the  software  includes  a  "DECISION" 
function  which  automatically  selects  the  best 
alternative.  Its  use  Is  Illustrated  In  Exhibit  3,  In 
which  the  operating  costs  E  and  F  have  first  been 
subtracted  from  an  arbitrary  revenue  of  12  to  yield  an 
operating  profit.  This  Is  because  the  decision  rules 
are  designed  for  maximization  rather  than 
minimization. 
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ExMple  1:  Apply  Decision  Rules  to  E  and  F  Cperatlr.g  Profits 
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Exhibit  3 


To  review,  we  have  seen  that  an  analyst  can 
encode  uncertainty  directly  by  invoking  probability 
density  functions  In  assignment  statements  for  the 
probabilistic  variables.-  And  expressions  containing 
the  probabilistic  variables  are  entered  much  as 
regular  expressions  except  that  special  functions  (PL, 
HI,  TI,  01,  EX)  are  used  in  place  of  the  primitive 
operators(+,  x,  ♦,  and  exponentiation).  The 
probabilistic  results  can  be  examined  with  a  variety 
of  statistical  and  graphical  functions  to  aid  the 
decision  making,  and  several  decision  rules  can  be 
invoked  automatically  with  the  “DECISION"  function  to 
automate  the  choice  among  risky  alternatives.  In  the 
next  section,  end-to-end  transit  times  in  a  message 
processing  network  will  be  characterized  with  the  aid 
of  the  software. 
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A  Co— and  and  Control  Application:  Message 
Processing  Performance 

Robustness  In  command  and  control  (C^)  networks 
is  often  enhanced  by  redundancy  of  message  pathways, 
fhe  links  of  any  path  may  encompass  a  diversity  of 
'requencles,  transmission  media,  and  formats  (e.g. 
mice,  telemetry).  A  variety  of  processes  occur  at 
rodes  to  convert  messages  for  retransmission  where 
:ontent,  format,  transmission  medium,  frequency,  or 
ither  significant  properties  of  the  message  may 
:hange.  Exhibit  4  depicts  redundant  paths  through  a 
lypothetlcal  message  processing  network.  The 
abjective  1$  to  transmit  an  authorized  message 
(abbreviated  "MSG"  In  the  exhibit)  to  a  recipient  who 
can  determine  that  It  Is  valid  and  who  can  take  the 
actlons(s)  Indicated  by  the  message. 

The  paths,  or  subsystems,  are  Identified  as 
follows: 

o  On  Path  A,  the  message  Is  processed  for 
transmission  on  a  data  network  (Process  PI), 
Is  transmitted  (P3),  received  and  validated 
(P10); 

o  on  Path  8,  It  Is  processed  (P2),  transmitted 
(P4),  received  and  recoded  for  voice 


transmission  (P5),  transmitted  as  voice  (P6) 
with  real-time  relay  through  E,  received  and 
processed  (P9)  at  the  destination  and 
validated  (P10); 

o  on  Path  C,  the  message  Is  processed  for  voice 
transmission  (P5)  and  sent  (P7)  directly  to 
the  destination  where  It  is  received, 
processed  (P9),  and  validated  (PIO); 

o  finally,  on  path  0  It  Is  processed  for  voice 
transmission  (PS)  as  In  path  C,  but  must  pass 
through  and  Intermediate  rece  pt  and 
transmission  stage  Involving  bacx-to-back 
processing  (P6  followed  by  P8)  before  being 
received,  processed  (P9)  and  validated  (PIO) 
at  the  destination. 


All  of  the  processes  are  assumed  to  be 
statistically  Independent  of  each  other.  Nominal 
performance  statistics,  which  are  also  hypothetical 
though  not  unrealistic,  are  given  in  Exhibit  5.  I 
chose  the  Gantna  Distrlbltion  to  represent  the 
performance  statistics,  because  the  Garnna  Distribution 
Is  unlmodal  and  positively  skewed.  Is  defined  over  the 
domain  zero  to  Infinity  and  has  a  mean  and  standard 
deviation  which  are  simple  functions  of  the  parameters 
of  the  probability  density  function.10  The  parameters 
of  the  distributions  for  each  independent  process  are 
given  In  the  right-most  column  In  Exhibit  5.  Sumiary 
graphical  comparisons  of  the  Individual  processing 
times  are  shown  in  Exhibit  6. 


The  end-to-end  transit  times  for  messages  are  the 
sums  of  the  processes  along  each  path:  (PI  +  P3  + 
PIO),  for  Path  A;  (P2  +  P4  +  P5  +  P6  +  P9  +  Pll),  for 
Path  B;  (P5  +  P7  ♦  P9  >  PIO),  for  Path  C,  and  (P5  +  P6 
♦  P8  +  P9  +  PIO),  for  Path  D.  Because  each  process  Is 
treated  as  a  probabilistic  variable,  each  path  sum  Is 
found  using  the  probabilistic  summing  function,  PL, 
Instead  of  the  plus  sign.  The  assignment  statement 
which  sets  up  the  sum  for  path  0  Is  shown  In  Exhibit  7 
along  with  graphical  and  statistical  analyses  of  the 
results.  Note  the  smooth  envelope  of  the  histogram¬ 
like  stemleaf  plot  of  the  end-to-end  time.  The  plots 
for  the  other  paths  are  similar.  Summary  graphical 
comparisons  of  the  end-to-end  transit  times  are  shown 
In  Exhibit  8. 


Example  2:  Network  Message  Processing 


Exhibit  4 
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Example  2:  Nominal  Processing  Time  Statistics 
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Description  of  the  Software 

How  It  Works 


The  probability  density  function  (PDF)  of  a 
variable  which  Is  the  sum  of  two  probabilistic 
variables  is  the  mathematical  convolution  of  their 
PDFs.  Taking  a  convolution  is  like  scanning  one  PDF 
across  the  other,  exposing  every  point  in  the  domain 
of  one  probabilistic  variable  to  every  point  in  the 
domain  of  the  other  one. 


This  software  works  in  a  similar  way:  each 
probabilistic  variable  is  represented  by  a  vector  of  d 
numbers  which  constitutes  a  sample  from  the  universe 
of  numbers  with  the  indicated  PDF.  When  two  variables 
are  combined  by  addition,  subtraction,  multiplication, 
etc.,  all  possible  combinations  (additions, 
subtractions,  etc.)  of  the  elements  of  the  vectors 
representing  the  two  variables  are  formed.  This 
yields  a  table  of  dimension  N  x  N,  Exhibit  9.  Because 
the  table  Is  too  large  to  carry  around  in  the  computer 
for  further  calculations,  an  N-element  sample  is  taken 
from  the  table.  The  sample  Is  formed  by  sorting  the 

elements  of  the  table,  finding  the  median*  and 
bisecting  the  sorted  set  at  the  median.  Then  the 
lower  and  uper  halves  are  bisected  at  their  respactive 
medians;  then  each  quartile  is  bisected,  and  so  forth, 
until  a  sufficient  number  of  medians  result.  The 
number  of  fractiles  equals  2P,  where  P  Is  the  number 
of  successive  bisections.  Therefore,  the  number  of 
"'■plans  (the  boundaries  between  adjacent  fractiles)  is 


Once  two  variables  are  combined  through  N  x  N 
expansion  followed  by  median  bisection  shrinking,  the 
result  Is  available  for  combining  in  like  manner  with 
another  variable  (for  example,  in  the  sum  A  +  B  +  C,  B 
and  C  are  combined  first  —  APL  evaluates  expressions 
from  right  to  left  —  and  the  result  Is  then  combined 
with  A).  It  Is  convenient  to  keep  the  vectors  the 
same  length  throughout  this  process.  For  P  *  7,  there 
are  12Z  elements  In  the  vector.  If  M  is  chosen  to  be 
127,  tr  Is  16,129  entries.  A  table  this  size  uses  a 
lot  of  main  memory  —  over  120  kilobytes  for  8-byte 
floating  point  numbers.  (For  P  »  8,  Nz  is  over  65,536 
requiring  over  520  kilobytes  of  storage.)  Thus,  for 
practical  purposes,  the  sample  size  for  each 
probabilistic  variable  Is  127. u 

Why  bisect  by  medians?  In  part.  It  is  a 
philosophical  throwback  to  methods  for  eliciting 
subjective  probability  distributions  for  uncertain 
parameters.  The  subject  is  asked  to  give  an  estimate 
such  that  the  actual,  but  unknown,  value  is 
equlprobably  above  or  below  the  estimate.  This 
bisects  the  probability  distribution  at  the  median. 
Then  the  subject  is  told  to  assume  that  the  actual 
value  lies  in  the  lower  half  of  the  bisected 
distribution  and  Is  asked  to  estimate  a  value  such 
that  the  actual  value  is  equlprobably  above  or  below 
It.  This  estimate  bisects  the  lower  half.  The  same 
operation  is  done  with  the  upper  half,  yielding  the 
boundary  values  of  the  quartiles  of  the  subjective 
probability  distribution.  The  process  can  be  repeated 
to  yield  octlles  and  so  forth  (at  the  risk  of 
exhausting  the  subject).  Because  many  of  the  inputs 
to  probabilistic  computations  with  this  model  will  ue 
subjective  estimates  of  uncertainty  rather  than  best- 
flts  to  empirical  data,  there  is  a  consistency  in 
using  the  median  bisection  technique  to  shrink  the 
table  down  to  manageable  proportions.  Also,  because 
many  of  the  operations  are  inherently  non-linear,  it 
makes  more  sense  to  bisect  by  medians  than  by 
alternatives  such  as  means  (should  one  use  geometric 
means  for  operations  involving  multiplications,  for 
example?). 


However,  by  selecting  only  the  medians,  the 
extreme  tails  of  the  -  member  distribution  are  left 
out,  e.g.  there  will  be  125  entries  in  the  16,129  - 
element  table  that  will  be  less  (greater)  than  the 
lowest  (highest)  number  in  the  127  -  element  sample  of 
medians.  This  omission  may  be  undesirable  to  analysts 
who  focus  on  the  extreme  values  of  a  distribution. 
When  the  results  of  using  this  software  were  compared 
with  theoretical  calculations,  the  standard  deviations 
often  underestimated  the  theoretical  values  by  about  5 
or  6  percent.  This  could  be  related  to  truncating  the 
tails  during  median  bisection.  The  discrepancy  might 
be  mitigated  by  including  the  two  end  points  to  yield 
a  Zv  +  1  or  129  -  element  sample,  but  I  have  not  yet 
investigated  this  possibility. 

What  Crpabilities  It  Includes 


This  is  a  general  purpose  probabilistic 
mathematics  system.  The  functions  which  should  prove 
useful  to  general  users  are  listed  in  Exhibit  10. 
Tney  are  described  below. 

Continuous  PDFs.  These  represent  continuous 
random  variables  by  sets  of  numbers  sampled  from  the 
universe  of  appropriately  distributed  numbers.  The 
density  of  sampled  numbers  between  any  two  points  in 
the  domain  of  the  PDF  is  roughly  proportional  (within 
sampling  accuracy)  to  the  area  under  the  PDF  between 
those  two  points.  Sample  generation  starts  with 
pseudo-random  numbers.  These  are  then  transformed 
into  the  appropriate  distributions  by  a  variety  of 
algorithms  which  include  inverse  transformations  and 
acceptance-rejection  methods.  3  As  was  illustrated  in 
the  examples,  it  is  helpful  to  display  the  sample  with 
some  combination  of  a  boxplot,  stemleaf  anu 
statistical  summary  to  make  sure  that  it  fits  the 
actual  PDF  accurately  (for  example,  is  the  sample  mean 
within  a  few  standard  errors  of  the  theoietical 
mean?).  If  not,  new  samples  should  be  generated  until 
one  that  satisfies  the  user's  needs  is  obtained. 
Checks  like  these  could  be  built  into  a  production 
version  of  the  software,  but  for  now  they  must  be  done 
manually. 

Discrete  PDFs.  These  represent  discrete  random 
variables  (e.g. ,  integers)  in  a  way  similar  to  the 
continuous  PDFs  just  described. 

Primitives.  These  do  probabilistic  addition, 
subtraction,  multiplication,  division  and 
exponentiation.  They  work  by  computing  a  numerical 
analog  of  a  convolution,  as  described  above. 

Display  Functions.  These  produce  statistical 
sumraries  and  graphical  representations  of  one  or  more 
samples.  Their  use  has  been  illustrated  in  the 
examples.  Three  of  these  functions  merit  special 
mention:  "STEMLEAF",  "BOXPLOT"  and  "COMPARE",  all  of 
which  were  taken  from  [McNeil,  1977]. 

The  stemleaf  plot  appears  visually  similar  to  a 
histogram  with  a  vertical  axis,  but  it  contains  more 
information  about  the  numbers  in  the  sample. 
"STEMLEAF"  divides  the  sample  range  into  intervals  and 
prints  two  leading  significant  digits  of  each  interval 
to  the  left  side  of  the  vertical  axis.  A  digit  equal 
to  the  (rounded)  third  significant  figure  of  each 
number  in  the  sample  is  printed  to  the  right  of  the 
axis  and  aligned  with  the  two-digit  number 
representing  the  proper  interval.  Thus,  in  addition 
to  giving  a  visual  impression  of  how  the  sample  is 
distributed,  the  stem  leaf  plot  shows  the  actual 
(rounded)  numbers. 
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The  "BOXPLOT"  function  produces  an  alternative 
;ua1  Impression  of  the  sample  distribution.  Imagine 
clng  an  Invisible  mark  on  the  horizontal  or  vertical 
.  is  for  each  member  of  the  sample.  Then  enclose  the 
ige  from  the  25th  tc  the  75th  percentiles  (the 
iterquartile  distance")  In  a  box.  Denote  the  median 
3th  percentile)  by  an  asterisk  (horizontal  boxplot) 
a  hyphen  (vertical  boxplot)  placed  on  the  axis 
side  the  box.  Next,  place  t.«o  x's  on  the  axis  to 
rk  the  sample  members  furthest  from,  but  within  one 
terquartlle  distance  of  either  side  of  the  box. 
note  with  open  circles  each  sample  member  lying 
tslde  the  x's  (denote  multiple  values,  where  two  or 
re  circles  overlap  within  the  resolution  of  the 
Inter,  with  an  adjacent  numeral).  Identify  as 
tilers  those  circles  lying  more  than  1.5 
terquartlle  distances  outside  the  box  by  filling  In 
e  circles.  Significantly  skewed  distributions  are 
slly  detected  by  t  >t  lack  of  symmetry  (of  the 
dlan,  x's  and  circles)  with  respect  to  the  box. 
ghly  peaked  distributions  will  have  few  circles; 
ffuse  distributions  should  have  several.  This 
escrlptlon  for  a  boxplot  was  adapted  by  McNeil  from 
bn  Tukey's  original  version. 

•COMPARE*  prints  side-by-side  vertical  boxplots, 
1  to  the  same  scale.  The  upper  and  lower  bounds  of 
ie  total  range  of  the  samples  are  printed  at  the 
tper  left.  "COMPARE"  facilitates  a  visual  comparison 
;  several  samples  (recall  that  10  processing  time 
istrbutlons  were  compared  In  Exhibit  6)  by  depicting 
te  relative  positions  of  their  medians,  the 
imenslons  of  their  dispersions,  and  their  ranges. 

Correlation.  By  taking  all  possible  combinations 
'  variates  In  samples  In  N  x  N  expansion,  the 
•Imltlve  algorithms  destroy  any  correlation  between 
‘obablllstlc  variables.  Sometimes  It  Is  desirable  to 
ilntaln  a  dependency  among  variables  In  an 
juatlon.  "CORR*  Is  a  function  which  Is  patterned 
fter  Bonner  and  Moore  Associates'  correlation 
lgorlthm.  It  allows  the  user  to  generate  a  normally 


distributed  sample  (or  to  approximate  other  synmetrlc 
distributions)  which  Is  correlated  with  one  or  two 
already  generated  samples  (which  may,  but  need  not,  be 
correlated  with  each  other).  The  user  specifies  the 
other  probabilistic  .arlables  and  the  desired 
correlation  coefficients  as  arguments  of  the 
function.  The  correlation  coefficients  must  satisfy 
certain  constraints,  and  "CORR1'  checks  for  this. 

Once  the  correlated  sets  have  been  generated,  the 
order  of  their  elements  must  be  preserved,  and  the  N  x 
N  table  expansion  cannot  be  used.  They  can  be 
combined  In  APL  just  by  using  the  native  primitive 
operators  (  +,  -,  x,  ♦,  and  *  for  exponentiation) 
which  combine  the  N  elements  of  each  sample  on  a  one- 
on-one  basis.  The  resulting  N-element  set  will 
incorporate  the  Influence  of  the  dependency  among  Its 
variables.  It  can  then  be  combined  freely  with  other 
non-correlated  variables  using  the  regular  functions 
already  described. 

The  function  "RHO*  calculates  the  correlation 
coefficient  between  any  two  N-element  samples.  "MAT* 
displays  the  moment  matrix,  covariance  matrix,  and 
correlation  matrix  for  N-element  samples.  "RHO"  and 
"MAT"  can  be  used  to  test  for  or  to  verify 
correlations  among  variables. 

Miscellaneous  Functions.  The  six  functions  shown 
are  machine-dependent.  They  are  coded  for  IBM’s  VS 
APL  running  under  the  Conversational  Monitor  System 
(CMS).  With  Implementation-  dependent  modifications, 
however,  they  can  be  adapted  to  versions  of  APL 
runnlr.j  on  other  systems.  The  most  Important  function 
of  this  group  Is  “IS".  It  takes  the  place  of  APL's 
left-pointing  assignment  arrow,  and  It  permits  use  of 
this  software  system  from  a  non-APL  keyboard.14 
"RESET"  also  performs  an  operation  that  would  require 
a  special  APL  character,  the  right-pointing  arrow.  It 
clears  out  any  calculations  which  may  have  been 
suspended  as  well  as  functions  which  may  have  been 
pendant  (awaiting  Intermediate  results)  when  an  error 
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Is  encountered  and  calculation  stops.  And  It  also 
saves  the  cleaned-up  workspace. 

There  are  several  other  1i. eluded  functions  which 
are  called  by  the  functions  listed  In  Exhibit  10,  but 
which  are  of  no  direct  u1 .  to  the  user.  For  example 
the  placement  of  the  graphic  characters  In  the 
displays  generated  by  "BOXPLOT"  and  “COMPARE"  are 
computed  by  the  function  “FILL*. 


SOFTWARE  SUMMARY 

Probability  Density  Functions  (PDFs) 


Discrete  PDFs 

BIN  Binomial 
GEO  Geometric 
HYP  Hyper geometric 
NBI  Negative  Binomial 
POI  Poisson 


Continuous  PDFs 

BET  8eta 
CHI  Chi -squared 
EXP  Exponential 
GAM  Gamma 
LOG  Log  normal 
NOR  Normal 
RAY  Rayleigh 

(circular  normal) 
TRA  Trapezoidal 
TUI  Triangular 
UNI  Uniform 
NEI  Welbull 


Primitive  Functions 

01  Division  (“divided  by") 

EX  Exponentiation 
HI  Subtraction  (“minus") 

PL  Addition  ( “plus") 

TI  Multiplication  (“times") 

01 splay  Functions 

BOXPLOr  Graphic  representation  of  a  sample 
distribution 

CENT  Centlles  of  a  sample  of  numbers 
COMPARE  Side-by-side  comparison  of  several  boxplots 
CPLOT  Plot  of  cumulative  distribution  of  a  sample 
DECISION  Applies  decision  rules  and  Identifies  best 
alternatives 

DSTAT  Sumnary  statistics  of  a  sample 

FREQ  Unique  members  of  a  sample  and  their 
frequency  of  occurrence 
PLOT  An  X-Y  plot,  scattergram 

j SfEHEAF  A  histogram-like  representation  of  a  sample 


Other  Functions 


I  Correlation 


Forms  normally  distributed  sample  with 
prescribed  correlation  to  one  or 
two  other  samples 

Moment,  covariance  and  correlation 
matrices  among  samples 
Correlation  coefficient  between  two 
samples 


I  Miscellaneous  ( Inplementatlon-0 


ndent)  Functions 


DTG  Prints  date  and  time 

IS  Implements  assignment  operator  for 

non-APL  keyboards 

RESET  Clears  out  suspended  operations  and 
saves  the  workspace 

SKIP  Creates  blank  lines  on  the  output  display 
STATUS  Shows  CPU  use  and  workspace  status 
TIME  Times  the  execution  of  a  function 

Exhibit  10 


Concluding  Remarks 

The  motivation  for  creating  this  software  grew 
from  my  Interest  In  examining  and  quantitatively 
describing  the  Influence  of  uncertainty  on  making 
decisions  —  especially  when  an  explicit  treatment  of 
uncertainty  and  an  evaluation  of  the  resulting  risk 
reverses  the  decision  that  would  otherwise  nave  been 
made.  My  Intent  was  to  develop  a  general  purpose 
system  that  requires  little  set-up  time  to  run  a 
problem  and  In  which  the  problem  can  be  entered  in  « 
form  that  Is  natural  to  analysts. 

The  system  grew  by  pieces  as  a  spare-time 
Interest  over  the  rast  18  months.  Now,  it  Is  in  a 
develop  .lent  state  an d  ready  for  testing  and  refinement 
with  actual  applications.  I  would  welcome  the 
opportunity  to  collaborate  with  Investigators  who  seek 
a  way  of  systematically  evaluating  the  role  of 
uncertainty  In  their  quantitative  analyses. 

The  graphics  are  of  a  homespun  variety.  One 
tends  to  appreciate  their  richness  with  use,  and  to 
forgive  the  unpredictability  of  their  embedded  scaling 
algorithms.  Yet,  with  all  the  conmerclal  graphics 
software  on  the  market,  the  possibility  of  adding  some 
gloss  and  sparkle  to  the  display  capabilities  is 
enticing.  Suggestions  and  opportunities  for 
collaboration  on  the  display  features  are  therefore 
also  invited. 

Footnotes 

*  This  particular  Implementation  of  the  software  has 
been  coded  In  the  APL  programing  language  because 
APL  has  built-in  array  processing  features  that 
facilitate  the  necessary  computations.  In.  APL,  the 
assignment  syrbol  Is  the  back -arrow  —  ; 
multiplication,  is  x;  division.  Is  *,  and 
exponentiation  Is  a  single  asterisk,  *.  However, 
beyond  knowing  these  few  symbols  and  the  precedence 
relationships  for  combining  operators  In  APL,  the  user 
need  not  have  a  working  knowledge  of  APL. 

2  The  software  Includes  a  function  “CENT"  to  extract 
this  information.  Its  syntax  Is  ?  CENT  X,  where  P  Is 
a  list  containing  one  or  more  percents  of 
Interest.  The  output  contains  the  corresponding 
centlles  of  the  orobablllstlc  result  X. 

^  These  capabilities  are  also  included  In  the 
software:  a  function  which  compiles  summary 
statistics  and  several  functions  which  produce 
graphical  representations  of  results  on  the  printer 
are  Included. 

4  Another  closely  related  definition  of  risk  consiuers 
only  that  portion  of  the  dispersion  that  falls  to  one 
(undesired)  side  of  a  critical  threshold  such  as 
prof1t-lo>s  breakeven  point,  yield  strength  of 
material,  toxicity  level  of  a  dangerous  substance, 
etc. 

5  Decision  rules  which  balance  expected  outcomes  with 
risk  are  called  mean-risk  decision  functions.  The 
software  includes  a  function  which  applies  a  nunber  of 
decision  rules  (Including  mean-risk  and  game-theoretic 
rules)  to  probabilistic  alternatives  and  Indicates  the 
best  choice  for  each  rule.  It  also  Includes  graphical 
comparison  capabilities. 

6  Department  of  Engineering  and  Public  Policy, 
Carnegie -Mel  Ion  University,  5000  Forbes  Avenue, 
Pittsburgh,  PA  15213,  (412)578-2672. 

2  Leo  H.  Groner,  IBM,  Inc.,  Box  390,  Poughkeepsie, 
N.Y.  12602,  (914)463-3615. 
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8  Alan  Jenkln,  Bonner  and  Moore  Associates.  Inc.. 
2727  Allen  Parkway,  Houston,  TX  77019; 

(713)522-6800. 

8  Notes:  (1)  The  left-pointing  arrow  Is  used  for 
assignment  In  the  APL  language;  the  function  "IS"  has 
also  been  provided  for  use  on  keyboards  which  do  not 
have  the  arrow;  (2)  The  number  127  Instructs  the 
computer  to  represent  the  probabilistic  variable  by  a 
vector  of  127  variates  —  In  a  later,  more  refined 
version  of  the  software.  It  should  no  longer  be 
necessary  to  specify  the  number  of  variates;  (3)  The 
boxplot  [McNeil,  1977]  which  Is  a  representation  of 
the  distribution— the  median  Is  represented  by  the 
asterisk,  and  outliers  are  open  (near)  or  fllled-ln 
(far)  circles  —  and  the  statistical  description 
Invoked  with  the  function  "DSTAT"  [Ramsey,  1981J  are 
not  obligatory;  they  are  available  to  characterize  and 
check  the  statistical  samples  which  have  been 
generated  to  represent  the  probabilistic  variables. 

If  the  parameters  are  E  and  l,  the  mean  Is  E/L,  and 
the  variance  Is  E/Lz.  Also,  the  mode  Is  given  by  a 
simple  expression,  (E-l)/L,  which  adds  to  the 
convenience  of  using  the  Gamma  Olstlbutlon. 

^  Bullers,  J.  W.,  "Methodology  for  Calculating  the 
Expected  Performance  of  European  Theater 
Coanunlcatlons  Systems",  working  draft  dated  31  March 
1983,  The  MITRE  Corporation,  McLean,  VA  22102 

32  Even  this  manageable  number  requires  a  1  Megabyte 
virtual  machine  on  an  IBM  370  mainframe  running  under 
CMS. 

13  See,  for  example,  [Fishman,  1978]. 

14  For  example,  one  can  type  'X'  IS  '127  NOR  5  3' 
(enclosing  the  left  and  right  arguments  In  single 
quotes).  This  Is  equivalent  to  the  assignment 
statement  X-127  NOR  5  3. 
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COMMAND  DECISION  MAKERS  AND  THEIR  MODES  OF  INTERACTION 


P.  D.  Morgan 


Scicon  Ltd.,  49  Berners  St.,  London  W1P  4AQ 


Abstract .  An  analytic  model  of  the  basic  decision 
making  process  is  presented;  in  this  it  is  shown  how 
extfinal  inputs  may  be  employed  to  reduce  the  scale  of 
the  decision  making  task,  which  is  represented  in  the 
form  of  a  pair  of  coupled  decision  making  processes. 
An  approach  based  on  control  and  estimation  theory  is 
used  in  modelling  the  decision  making  process;  this  is 
supplemented  by  concepts  drawn  from  knowledge 
representation  and  from  game  tree  search.  Finally  the 
decision  maker  is  examined  as  a  part  of  a  control 
process  in  order  to  identify  the  relative  tempos  of 
different  operations  and  Interactions. 


1. 


INTRODUCTION 


Preceding  page  blank 


iv) 


terms  of  some  objective  function,  or  of  some 
goal  state; 

a  search  strategy  for  evaluating  and  selecting 
options  in  terms  of  the  current  and  goal  states. 


The  Commander  and  The  Command  System  are  subjected  to  a 
steadily  increasing  load  due  to  the  continuing  increase 
in  the  range  of  the  combat  horizon  and  the  steady 
development  of  Counter  C2  techniques.  The  solution 
generally  adopted  to  support  this  load  has  been  to 
resolve  the  Commander' c  objectives  into  ranges  of  more 
limited  objectives,  and  to  assign  these  to  subordinate 
commanders;  however  this  introduces  problems  of 
co-ordination  between  these  subordinates  (1),  which  at 
present  are  handled  entirely  manually.  The 
introduction  of  automation  into  the  Command  System  has 
made  apparent  the  need  for  a  quantitative  understanding 
of  the  command  decision  process  and  of  the  interactions 
between  decision  makers;  this  understanding  is 
required  to  provide  a  basis  for  the  analysis  and  design 
of  Command  Systems. 

In  previous  work  the  operation  of  the  Command  System 
was  represented  In  terms  of  a  basic  module;  this 
Included  the  decision  making  process  together  with  its 
means  of  interacting  with  its  environment  {2].  In  this 
approach  the  problems  of  controllability  and 
observability  [3]  were  addressed  by  means  of  the 
nesting  of  modules  to  reflect  the  hierarchical 
structure,  and  the  use  of  'Volumes  of  Interest'  and 
their  overlaps  to  determine  lateral  communications  11], 
1*1. 

A  further  conclusion  of  this  work  was  that  decision 
makers  within  the  Command  System  can  be  classified  in 
terms  of  goal-seeking  behaviour  (5],  16];  the  three 
classes  used  correspond  to:  goal  seeking  behaviour; 
multi-goal  seeking,  adaptive  behaviour;  and  ideal 
seeking,  learning  behaviour.  On  this  basis  it  is 
apparent  that  the  decision  maker  must  be  provided  with 

i)  resources  usable  to  explore  and  modify  the 
environment; 

ii)  finite  sets  of  population  and  control  models 
representing  the  contents  of  the  external 
environment  and  the  operation  of  his  resources 
within  the  environment; 

ill)  a  goal  or  goals  which  may  be  represented  in 


Examination  of  these  requirements  in  terms  of  control 
and  estimation  theory,  and  of  game  tree  search  [7] 
shows  that  a  decision  maker  with  finite  data  processing 
capacity  requires  some  method  of  truncating  the 
estimation  and  search  processes  when  handling  other 
than  trivial  tasks.  These  truncations  carry  with  them 
the  risk  of  excluding  appropriate  decisions; 
consequently  it  is  necessary  that  the  decision  maker  be 
capable  of  adapting  the  truncation  processes  in 
accordance  with  past  experience  and  the  current  state 
of  the  envitonment. 

The  model  of  the  adaptive  decision  maker  is  based  on 
the  two  stage  model  of  Boettcher  and  Levis  ]8]  and  on 
the  model  previously  discussed  (2].  The  basic  decision 
making  process  consists  of  the  determination  of  the 
state  of  the  environment  (Situation  Evaluation)  and  the 
choice  of  an  appropriate  response  (Response 
Allocation).  Adaptive  behaviour  is  obtained  by 
modelling  the  decision  maker  in  terms  of  two  parallel 
decision  processes;  an  'on-line'  decision  process 
concerned  with  assessing  the  environment  and  directing 
the  operation  of  the  resources,  and  an  'off-line' 
decision  process  concerned  with  optimising  the 
truncation  of  the  estimation  and  search  processes. 

This  model  was  developed  to  explore  the  various  modes 
of  interaction  within  a  Command  System,  and  as  an  aid 
to  determining  appropriate  man/machine  roles  within  a 
Command  System.  An  approach  based  on  control  and 
estimation  theory  was  used  as  it  facilitated 
examination  of  the  internal  and  external  interactions 
of  the  decision  maker. 


2. 


THE  BASIC  DECISION  MAKING  VROCESS 


The  decision  maker  is  envisaged  as  operating  in  an 
environment  in  which  a  variety  of  distinguishable 
phenomena  may  occur,  either  serially  or  in  parallel. 
The  rhenomena  are  represented  to  the  decision  maker  in 
the  form  of  a  time-ordered  sequence  of  noisy 
measurements,  x,  which  are  evaluated  to  determine  their 
individual  significances  by  the  Situation  Evaluation 
element;  the  output  from  Situation  Evaluation  takes 
the  form  of  a  sequence  of  estimates,  z,  of  elements  of 
the  situation  related  to  individual  phenomena,  or  to 
associated  groups  of  phenomena.  In  the  Response 
Allocation  e' oment  this  sequence  of  estimates  is  used 
to  determine  the  discrepancy  between  the  goal  state, 
Zg,  and  the  current  state,  Z^,  and  to  select 
appropriate  control  outputs,  y;  these  outputs 
correspond  to  the  allocation  of  particular  responses  to 
the  various  resources  controlled  by  the  decision  maker. 

In  this  case  it  is  assumed  that  each  phenomenon  is  a 
member  of  one  of  a  finite  but  possibly  large  set  of 
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Fig.  1  Situation  Evaluation 


populations  or  classes,  and  that  the  decision  maker  has 
access  to  a  finite  hut  possibly  large  set  of  responses. 
Each  population  has  an  associated  population  model  for 
the  generation  of  the  individual  situation  estimates, 
c,  on  the  basis  of  the  measures,  x;  similarly  each 
response  has  associated  control  models  for  the 
generation  of  the  control  outputs.  Thus  the  Situation 
Evaluation  element  is  concerned  with  the  identification 
of  the  population  from  which  the  measure,  x,  is  drawn, 
followed  by  the  application  of  the  relevant  population 
model;  and  the  Response  allocation  element  la 
concerned  with  the  evaluation  and  selection  of  a 
particular  response,  followed  by  the  application  of  the 
associated  control  model. 


P<fn<*;l*t(k))  - 


(1) 


where  p(fn<x))  is  the  prior  probability  that  an 
observation,  x,  will  be  drawn  from  the  n  population. 
p(Xj(k)lf  (x)>  Is  the  likelihood  of  a  member  of  the  n 
population  giving  rise  to  the  observation  x  (k)  ,  and 
p(Xj(k))  la  the  probability  that  x^fk)  has  its1  assigned 
value  In  X.  This  representation  relies  on  initial 
knowledge  for  the  determination  of  the  prior 
probabilities  and  the  likelihoods;  the  manner  in  which 
these  two  terms  are  handled  Impacts  on  the  adaption  of 
the  decision  making  process. 


The  tasks  of  searching  through  these  range  of  these 
populations  and  responses  will  result  In  an 
unacceptable  processing  load;  consequently  some  means 
of  truncating  the  searches  Is  required.  The  approach 
adopted  is  similar  to  the  'frame'  approach  to  knowledge 
representation  [7];  it  assumed  that  In  any  given 
period  the  populations  observable  will  be  members  of  a 
predefined  subset  of  the  set  of  all  known  populations; 
in  addition,  the  available  responses  will  form  a  subset 
of  the  set  of  possible  responses;  in  combination  this 
pair  of  subsets  corresponds  to  a  frame.  The  frame  to 
be  employed  is  defined  for  the  decision  maker  on  the 
basis  of  the  expected  situation,  the  Commander's 
objectives,  and  any  constraints  on  the  use  of  resources 
(e.g.  Rules  of  Engagement,  EMCON  plan,  etc).  Thus  the 
basic  decision  making  process  always  operates  within  an 
externally!  defined  frame  and  is  not  capable  of 
Instituting  a  transfer  to  some  alternate  frame. 

Situation  Evaluation 

I . 

The  input  to  the  Situation  Evaluation  element  from  the 
environment! Is  taken  to  consist  of  I  noisy  signals,  Xj 

where  1®I.\  The  application  of  a  sampling  plan  on  x 
provides  the!  Input  in  the  form  of  a  sequence  of  noisy 
measures,  xL(k),  which  are  Initially  taken  to 
correspond  fo  I  orthogonal  diserete-tine  dynamic 
PT0C®8£es*  tne  individual  measures,  x. (k) ,  are  vectors 
of  dimensionality rwhich  taken  their  values  from  the 
finite  alphabet  X. 

The  Situation  Evaluation  element  (figure  1)  consists  of 
two  components,  the  fitst  of  which  is  concerned  with 
identifying  the  population  from  which  x.(k)  is  drawn. 
The  probability  that  x.(k)  is  a  member  of  the 
population  is  taken  to  be: 


The  values  assigned  to  the  set  of  prior  probabilities 
reflect  the  expected  state  of  the  environment.  It  is 
assumed  that  in  any  given  period  a  large  proportion  of 
these  values  approximate  to  rero;  in  defining  a  frame 
these  values  are  set  to  xero,  restricting  the 
estimation  process  to  a  subset  of  the  populations.  The 
remaining  priors  are  assigned  Initial  values  p(fn(xo)). 
In  the  case  of  sequential  measures  on  dynamic  processes 
the  prior  probabilities  may  be  represented  as 
p(fn(x1(k))) ,  where 

pffJXjCO)))  -  p(fn(x0»  (2) 

The  evolution  of  the  prior  probabilities  with  the 
successive  measures  on  1  may  have  different  forms: 

I)  constant  prior  probabilities 

p(fn(x1(k)»  -  p(fn(xfl)>  (3) 

II)  recursive  prior  prebabi lltles 

P(fn(xi(k»)  -  p(fn(xi(k-l))|xi(k-l))  (4) 

ill)  adaptive  prior  probabilities  which  are  based  on 
the  preceding  posterior  probabilities  adjusted 
in  the  light  of  systematic  discrepancies 
identified  by  the  innovation  processes  of  the 
individual  population  models  [9]. 

The  first  two  alternatives  have  the  advantage  of 
simplicity,  but  the  disadvantage  that  any  initial 
trisestimation  of  the  priors  is  carried  through 

successive  estimates. 

associated  with  each  population  model.  It  is  assumed 
that  the  p.d.f,  of  each  model  is  defined  by  the 
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(k), 
are 

derived  from  the  previous  measurement  history. 

The  population  models,  f  (x) ,  provide  the  means  of 
generating  state  estimates?  *nj(k),  which  take  the  form 
of  vectors  of  dimensionality  s,  where 


predicted  measure,  ,(k) ,  and  variance, 
obtained  from  the  population  model,  where  they 


control  models  is  governed  by  an  externally  defined 
goal  state,  Z,.,  the  current  state  estimate,  Zg ,  and 
externally  defined  constraints  on  the  use  of  the 
control  models.  In  distinction  from  the  model  of 
Boettcher  and  Levis  (8],  the  aggregation  of  the 
individual  state  estimates,  z^,  to  form  Zq  Is  Included 
in  the  selection  process. 


s  *  r  (5) 

The  Individual  population  model  contains  a  state  model, 
which  provides  the  means  of  generating  the  state 
prediction,  ?  (k) ,  on  the  basis  of  the  previous  best 
state  estimate,  ?  ,(k-l);  in  addition  it  contains  a 
measurement  model  vrtiich  is  used  in  the  derivation  of 
£  (k)  from  “  (k)  and  x  (k) ,  and  the  generation  of 
Sn:(k)  from  S  j(k).  The  variance,  Vnj(k)  is  derived 
from  the  plant  noise  of  the  state  model  and  the 
measurement  noise  of  the  measurement  model. 

The  final  task  of  the  Situation  Evaluation  element  is 
that  of  electing  the  hypotheses  (populations  and  state 
estimates)  which  should  be  discarded.  The  objectives 
here  are  those  of  controlling  the  total  load  on  the 
decision  maker,  and  of  retaining  sufficient 
alternatives  to  permit  errors  to  be  identified  and 
corrected.  It  is  assumed  that  a  larger  number  of 
hypotheses  will  be  retained  within  Situation  Evaluation 
than  will  be  output  to  Response  Allocation,  and  that 
the  latter  set  is  included  within  the  former. 

The  selection  process  is  based  on  the  assignment  of 
costs  L  ,  (where  n,l*S)  to  the  populations,  the 
definition  of  the  costs  forming  part  of  the  definition 
of  the  frame.  The  selection  process  differs  from  that 
normally  used  in  signal  estimation  in  that  hypotheses 
are  tested  for  rejection  against  criteria  based  on  the 
total  number  of  hypotheses  extant.  The  minimum  cost 
for  any  hypothesis  is  determined  for  each  input  signal, 
and  hypotheses  are  ordered  with  respect  to  costs 
relative  to  these  minimum  costs;  hypotheses  are  then 
discarded  in  order  of  decreasing  relative  cost  until 
the  required  numbers  of  surviving  hypotheses  are 
attained. 

Response  Allocation 

The  Response  Allocation  element  (figure  Z)  contains  a 
range  of  control  models,  h  . (x),  capable  of  generating 
control  vectors,  y.(k) ,  on  the  basis  of  state 
estimates,  (k);  additionally  it  contains  the  means  of 
selecting  the  control  model  to  be  applied  to  a 
particular  state  estimate.  The  selection  of  the 


The  form  of  selection  process  employed  depends  on  the 
nature  of  the  decision  maker's  environment  and  or  his 
modes  of  interaction  with  it.  It  is  assumed  that  the 
state  evolves  stochastically  at  a  tempo  which  is  not 
subject  to  tbe  decision  maker's  control;  additionally 
it  is  assumed  that  it  is  not  generally  possible  to 
determine  the  complete  chains  of  operations  which 
transform  Z.  to  Z. ,  and  that  the  decision  maker  is 
provided  with  a  set  of  independent  resources  capable  of 
acting  on  his  environment. 

These  general  assumptions  are  extended  by  the  further, 
alternative,  sets  of  assumptions: 

i)  that  in  the  timescale  of  Interest  the  Individual 
are  independent,  and  that  each  control  model, 
tr(z),  corresponds  to  a  mode  of  operation  of  a 
single  resource  on  a  particular  population  or 
group  of  populations; 

il)  that  in  the  tlmeBcale  of  Interest  the  £  are 
best  considered  in  terms  of  Z,  and  that1  each 
control  model,  h  (z),  corresponds  to  the 
operation  of  a  number  of  resources. 

These  setB  of  assumptions  may  be  related  to  the 
different  levels  of  abstraction  in  ABSTR1PS  17],  [10]. 

In  the  selection  process  the  control  models  are 
represented  as  stochastic  operators  which  transform 
state  z  ,  (the  state  estimate  arising  from  the 
operation1  of  f  (x)  on  x.)  into  state  z  ,  with 
probability  (*_<));  1  feedback  from  thepcontrol 

models  will  be  niseo  toupdate  this  probability  during 
the  implementation  of  the  control.  The  combination  of 
this  probability  with  p(*ni) »  where 

p(*ni>  "  P(fn00|*t)  (6) 

provides  the  means  of  evaluating  the  probabilities  of 
achieving  intermediate  states,  p(Z  ),  through  the  use 
of  a  particular  range  of  processes. p  The  Z  may  then  be 
evaluated  to  determine  their  distances  frfir  Z_,  S(7.  ), 
to  provide  pruning  criteria  [7).  p 


Z.m 


Fig.  2  Response  Allocation 
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In  practise  limitations  have  tc  be  set  on  the  breadth 
and  depth  of  the  search.  It  has  already  been  mentioned 
that  the  definition  of  the  frame  Includes  the  provision 
of  constraints  on  the  selection  of  Individual  control 
models;  these  appear  as  weights  on  the  selection, 
W(z  ,p(z  )).  Alternative  approaches  using  these 
weights  are  goal-directed  game  tree  generation  using  a 
simplified  state  and  operator  representation  (10],  the 
Identification  and  use  of  appropriate  pre-planned 
strategies  [11],  or  the  use  of  Expert  System 
Techniques.  The  alternative  selected  will  depend  on 
the  natures  of  the  decision  maker's  environment  and 
task. 

The  outcome  of  this  selection  process  is  the  provision 
of  subsidiary  goal  states  to  the  Individual  control 
models.  A  standard  control  theory  approach  is  assumed 
for  the  control  models  and  the  generation  of  the 
control  vectors,  y  ,  [ 3 ] ;  each  control  model  Is  based 
on  a  single  physical  process  model. 


decision  making  process;  its  goal  Is  that  of  selecting 
the  optimum  combination  of  externally  defined  goal  and 
predefined  frame  for  Assessment  in  the  context  of  the 
observations  of  the  environment  end  of  Assessment. 

The  signal  Inputs  to  Planning  consist  of  the  measures 
of  the  environment  and  on  Situation  Evaluation  and 
Response  Allocation  in  Assessment  (e.g.  p(f  (x)|x  (k)), 
Z.,  Z  ,  p(Z  ),  S(Z  )).  The  control  Inputs  include  goal 
state!,  frames,  a?id  frame  parameters;  the  Planning 
frame,  the  related  set  of  goal  states  and  their 
associated  weights  are  expected  to  be  changed 
infrequently;  the  frame  parameters  are  expected  to  be 
changed  more  frequently,  corresponding  to  the  downward 
flow  of  Information  on  the  overall  situation  [A] .  Each 
control  model,  h  (z),  corresponds  to  a  signal 
Assessment  frame;  "hence  the  control  vectors,  y,(k), 
relate  either  to  the  alteration  of  parameters  within  a 
frame,  or  the  definition  of  a  new  frame. 


3.  THE  ADAPTIVE  DECISION  MAKER 

The  model  of  the  basic  decision  making  process  shows 
how  the  processing  load  on  the  decision  maker  may  be 
controlled  through  external  limitation  of  the  available 
options  and  the  provision  of  a  single  goal  state.  The 
extension  of  the  model  to  accommodate  self-adaptive, 
multi-goal  seeking  behaviour  is  achieved  through  the 
Interaction  of  a  pair  of  decision  making  processes 
(figure  3). 

The  'Assessment'  process  observes  the  external 
environment  and  generates  control  vectors  to  modify  the 
6tate  of  the  environment  towards  the  goal  state;  In 
carrying  out  these  activities  It  Is  constrained  by  the 
frame  defined  by  'Planning'.  The  Planning  process 
monitors  and  directs  the  operation  of  Assessment; 
direction  is  effected  by  control  of  the  Assessment 
frame;  a  possible  additional  control  mode  Is  that  of 
command  over-ride  or  veto.  The  relationship  between 
Assessment  and  Plarnlng  differs  from  that  between 
separate  Interacting  decision  makers  In  that  they  share 
a  common  input  from  the  environment,  and  that  their 
tempos  are  coupled. 

The  operation  of  Assessment  is  readily  represented  in 
terms  of  the  basic  decision  making  process  and  will  not 
be  considered  further.  Planning  also  operates  in 
accordance  with  the  basic  goal-seeking  model  of  the 


4.  THE  CONTROL  PROCESS 


The  Interaction  between  the  decision  maker  and  the 
command  system  is  represented  in  terms  of  the  Control 
Process  (figure  4).  In  this  model  'Perception' 
provides  the  data  collection,  fusion  and  filtering 
required  to  generate  x;  and  'Execution'  represents  the 
operation  of  decision  maker's  resources. 


There  is  a  considerable  body  of  work  on  the  problem  of 
signal  detection  and  estimation  [12],  [13];  a  factor 
which  is  observable  in  this  work  Is  the  asymptotic 
behaviour  of  p(f  (x)|x,(k)>  with  sample  size,  k.  This 
convergence  has  Been  addressed  by  Cramer  [14]  In  terms 
of  a  sampling  distribution  which  uses  the  least  squares 
principle  to  provide  a  measure  of  the  deviation  between 
the  observed  and  predicted  frequencies  on  the 
measurement.  In  this  analysis  St  Is  shown  that  the 
specified  sampling  distribution  tends  towards  the 
Chi-squared  distribution  as  k-a00;  it  is  also  stated 
that  the  sampling  distribution  approximates  to  the 
Chl-squared  distribution  when  the  predicted  frequencies 
In  each  sampling  region  are  >  10. 


The  operation  of  the  decision  maker  within  the  Control 
Process  may  be  Illustrated  by  the  consideration  of  the 
simple  optimal  control  problem  of  minimising  the 
objective  function 


The  operation  of  Assessment  is  readily  represented  in 
terms  of  the  basic  decision  making  process  and  vill  not 
be  considered  further.  Planning  also  operates  in 
accordance  with  the  basic  goat-seeking  model  of  the 
decision  making  process;  its  goal  is  that  of  selecting 
the  optimum  combination  of  externally  defined  goal  and 
predefined  frame  for  Assessment  in  the  cortext  of  the 
observations  of  the  environment  and  of  Assessment. 

The  signal  Inputs  to  Planning  consist  of  the  measures 
of  the  environment  and  on  Situation  Evaluation  and 
Response  Allocation  in  Assessment  (e.g.  p(f  (x)]x  (k7), 
Zpt  2  »  p(Z  ) 9  «(*»).  The  control  inputs  include  goal 
states,  frames,  ahd  frame  parameters;  the  Flanning 
frame,  the  related  set  of  goal  states  and  their 
associated  weights  are  expected  to  be  changed 
infrequently;  the  frame  parameters  are  expected  to  be 
changed  more  frequently,  ccrrespondlng  to  the  downward 
flow  of  information  on  the  overall  situation  (4],  Each 
control  model,  h  (z) ,  corresponds  to  a  signal 
Assessment  frame;  nence  the  control  vectors,  y  (k), 
relate  either  to  the  alteration  of  parameters  within  a 
frame,  or  the  definition  of  a  new  frame. 


4.  THE  CONTROL  PROCESS 

The  interaction  between  the  decision  maker  and  the 
command  system  is  represented  in  terms  of  the  Control 
Process  (figure  4).  In  this  model  ’Perception' 
provides  the  data  collection,  fusion  and  filtering 
required  co  generate  x;  and  'Execution'  represents  the 
operation  of  decision  maker's  resources. 

There  is  a  considerable  body  of  work  on  the  problem  of 
elgnal  detection  and  estimation  112] ,  [13];  a  factor 
which  Is  observable  In  this  work  Is  the  asymptotic 
behaviour  of  p(f  (x)|x  (k))  with  sample  size,  ic.  This 
convergence  has  been  addressed  by  Cramer  [14]  In  terms 
of  a  sampling  distribution  which  uses  the  least  squares 
principle  to  provide  a  measure  of  the  deviation  between 
the  observed  and  predicted  frequencies  on  the 
measurement.  In  this  analysis  it  Is  shown  that  the 
specified  sampling  distribution  tends  towards  the 
Chi-squared  distribution  as  k-moo  ;  it  is  also  stated 
that  the  sampling  distribution  approximates  to  the 
Chi-squared  distribution  wh  n  the  predicted  frequencies 
In  each  sampling  region  are  >  10. 

The  operation  of  the  decision  maker  within  the  Control 
Process  may  be  Illustrated  by  the  consideration  of  the 
simple  optimal  control  problem  of  minimising  the 
objective  function 


Rj.  4  Control  Process 


»(r.y>  -  \  Ua<t>  +  ya(t)]dt  (7) 

0 

with  alternative  process  models,  one  of  which 
accommodates  the  presence  of  a  hidden  variable;  using 
the  Fuler-Lagrarge  method,  general  solutions  for  (7) 

are  obtained  for  the  alternative  process  models,  these 
solutions  correspond  to  individual  h  (z),  Figure  5 
shows  the  evolution  of  the  state  vaFiable  resulting 
from  this  control,  the  lower  curve  is  the  optimal 

control  path  and  the  upper  curve  la  the  control  path 
resulting  from  the  effect  of  the  bidden  variable.  The 
figure  also  shows  three  basic  periodicities  in  the 
operation  of  the  Control  Process: 

i)  the  optimisation  period; 

11]  the  optimisation  cycle  corresponding  to  the 

selection  of  h  (z)  and  the  updating  of 

parameters  In  h_(^; 

n 

lii]  the  control  cycle  corresponding  to  the  tempo  of 
z,(k)  and  consequently  the  tempo  of  operation  of 

individual  h  (z), 

■ 

Application  of  these  periodicities  to  the  adaptive 
decision  maker  results  in  the  following  set  of 
relationships; 


Assessment  ! 

Planning 

Control  Cycle 

j 

Optimisation  Cycle 

Control  Cycle 

Optimisation  Pei lod 

Optimisation  Cycte 

- 

Optimisation  Period 

In  the  simplest  esse,  using  Cramer's  conclusions  for 
sampling  distributions  with  a  single  degree  of  freedom. 
It  appears  that  optimal  performance  will  be  obtained  by 
an  adaptive  decision  maker  when  the  tempo  ratio  between 
the  Assessment  control  cycle  and  the  Planning 
optimisation  period  is  of  the  order  of  1000:1.  J 

This  approach  may  be  extended  to  address  hierarchical 
and  lateral  interactions  within  a  Command  System;  the 
general  concepts  involved  have  been  discussed  in 
previous  papers,  [2]  and  [4], 

5 .  CONCLUSIONS 

The  application  of  a  control  and  estimation  theoretic 
approach  to  the  modelling  of  the  decision  making 
process  provides  a  means  of  identifying  different  mod»s 
of  Interaction  between  the  decision  making  process  and 
its  environment.  It  also  shows  how  external  knowledge 
may  be  used  limit  the  scope  of  the  decision  making 
process  In  accordance  with  some  higher  level 
appreciation  of  the  situation;  this  limits  the  range 
of  alternatives  to  be  pursued,  both  in  assessing  the 
situation  and  in  selecting  appropriate  responses,  with 
a  consequent  reduction  of  the  processing  load. 

A  case  which  is  of  particular  Interest  is  that  of  a 
decision  maker  capable  of  self-adaption  and  of  choosing 
between  alternative  objectives  in  response  to  changes 
In  the  environment.  A  model  of  an  adaptive  decision 
maker  Is  developed  from  the  model  of  the  basic  decision 
making  process  by  the  interconnection  of  two  such 
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processes;  this  provides  the  means  of  representing 
complex  decision  making  behaviour  In  terms  of 
relatively  simple  elements,  and  in  a  manner  which 
reduces  processing  requirements. 

The  incorporation  of  the  decision  maker  within  the 
Control  Process  provides  a  basis  for  analysing  the 
interactions  between  decision  makers  within  a  Command 
System  in  terms  of  tempo  of  operation  and  Its  impact  on 
decision  maker  performance.  This  has  obvious 
implications  in  the  area  of  the  development  and 
evaluation  of  Command  System  architectures. 
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‘  f  ABSTRACT 

Mode la  of  nenorv  and  information  atorage  naefnl 
in  the  modeling  and  analyala  of  decialonaaklng  with 
bounded  rationality  are  diacnaaed.  An  Infonaatlon 
theoretle  nodal  of  permanent  nenory  la  preaented  for 
deacrlblng  the  acceaelng  of  atored  Infonaatlon  by  the 
algorlthne  within  the  human  declalonnaker  nodal.  It  la 


then  applied  to  the  atndy  of  the  perfornance  - 
workload  characteristic  of  a  declalonnaker  perfornlng 


a  dnal  taak. 


1.  INTRODUCTION 


Information  theory  waa  flrat  developed  aa  an 
application  In  cornua 1c at  Ion  theory  [1].  But.  aa 
Khinchlu  (2]  ahowed.  It  la  alao  a  valid  mathematical 
theory  in  ita  own  right,  and  it  la  uaeful  for 
applicationa  In  naay  diaciplinea,  Including  the 

nodding  of  ainple  human  dec  la  ionmaking  proceaaea  [3] 
and  the  analyala  of  infomation-proceaaing  ayatena. 
Laming  (41  obaerved,  however,  that  the  human 

declalonnaker  doea  not  act  like  a  nenoryleaa 
eonnunicationa  channel,  and,  in  fact,  the  purpoee  of 
SK>at  decisionmaking  ayatena  la  quite  other  than  to 
reproduce  faithfully  at  the  output  what  waa  given  to 
the  ayaten  aa  input.  In  accordance  with  thia 

obaervation,  a  two-atage  information  theoretic  model 
of  the  deciaionnaklng  proceaa  haa  been  developed  [3], 
(61,  (71  which  includea  internal  variablea  and 

algorithna  between  the  input  and  the  output.  However, 
the  model  la  nenoryleaai  that  la.  It  la  unable  to 
racognlza  any  atatiatical  dependence  that  night  exlat 
in  the  input  or  acceaa  Internal  or  external  date 

baaea.  Thia  la  a  ainplifylng  but  very  Uniting 

aaaunption:  certainly  many  organlxatlona  receive  a 
variety  of  inputa  related  to  the  cane  aitnation,  and 
naay  of  theae  are  atatlatieally  dependent  on  one 
another.  Sen  and  Dreniek  [8]  recognized  the  need  for 
adding  nenory  to  nodela  of  deciaionnaklng  ayatena. 
They  modeled  the  human  decisionmaker  aa  an  adaptive 
channel,  i.e.,  a  channel  whose  input  nay  depend  on 
present  and  past  inputa.  With  thia  addition  of 
nenory.  they  achieved  results  which,  in  some 
experimental  situations,  reflect  obaerved  behavior. 
However,  they  have  nade  no  attempt  to  model  explicitly 
the  various  types  of  nenory  that  nay  be  found  in  a 
deciaionnaklng  ayaten. 

Several  nodela  of  memory  have  been  developed  [9]. 
Buffer  storage  allows  the  decisionmaking  ayaten  to 
proceaa  sequential  atatlatieally  dependent  inputa 
simultaneously.  Permanent  nenory  provides 
deciaionnaklng  ayatena  with  information  which  is  not 
updated  aa  a  result  of  internal  preceaaing,  while 
temporary  memory  allows  for  the  updating  of  the  stored 
information.  All  three  have  been  analyzed  (91  g 
however,  emphasis  is  placed  in  thia  paper  on  a  model 
of  permanent  nenory  and  its  use  in  the  analysis  of  a 
model  of  a  human  decisionmaker  faced  with  a  dual 
task. 


In  oomplex  situations  when  a  limited  amount  of 
tine  ia  available  for  the  decisionmaking  proceaa,  the 
declalonnaker  nay  be  better  modeled  aa  being  boundedly 
rational,  i.e.,  constrained  in  hia  abilities  to 
formulate  actions  and  foresee  consequences.  Rather 
than  always  being  able  to  make  the  optimal  decision,  a 
decisionmaker  with  hounded  rationality  may  satisfice, 
that  ia,  may  seek  to  satisfy  some  set  of  minimal 
criteria  ia  naklng  a  decision  (10). 

The  nodel  of  a  declalonnaker  with  bounded 
rationality  (5],  [6],  (7],  shown  in  Figure  1, 
conalsta  of  two  stages:  the  situation  aaaeaanent  (SA) 
and  ths  rsaponas  selection  (RS)  ones. 


Figure  1,  Model  of  deciaionnaklng  proceaa  with 
perfornance  evaluation  mechanism 

Ia  the  SA  stage,  one  of  B  algorithna  ia 
selected  via  the  variable  u  to  evaluate  the  input  and 
'hypothesize  about  ita  origin.*  The  output  of  the  SA 
stage,  z,  could  be  an  estimate  of  the  actual  signal 
given  the  obaerved  input,  or  aone  other  statistic  of 
the  input,  or  even  the  entire  input  itself.  The 
variable  z  is  then  given  to  the  response  selection 
stage  (RS),  and  one  of  V  algorithms  ia  chosen  via  the 
variable  v,  to  proceaa  the  evaluated  input  into  an 
appropriate  response.  Both  seta  of  algorithna  are 
assumed  to  be  deterministic,  so  that,  given  an  input 
x,  and  the  values  of  u  and  v,  the  output  y  may  be 
exactly  determined.  Bounded  rationality  ia  modeled  by 
requiring  that  the  total  rate  of  activity  of  the 
ayaten,  where  total  rate  of  activity  ia  a  well-defined 
information  theoretic  quantity,  be  leas  than  some 
maximum  value,  which  la  specific  to  a  given 
decisionmaking  system. 

The  performance  of  the  decisionmaker  ia  evaluated 
as  shown  in  Figure  1.  The  actual  input  x*  ia 
corrupted  by  noise,  n,  so  that  the  system  receives  x 
■x’+n,  a  noisy  version  of  the  input.  Thia  noise  could 
range  from  representing  actual  interference  with  a 
message  sent  to  the  decisionmaker  along  standard 
communications  channels,  to  representing  the 
decisionmaker’s  inability  to  observe  perfectly,  or 
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obtain  parfact  inforaxtion  partaining  to  hia 
envlronnent .  The  napping  L(x')  yields  y' ,  which  is 
defined  as  the  ideal  response  to  the  actual  input  x'l 
then  y'  is  conpared  to  the  output  of  the  system,  y. 
The  perforaance  neasure  of  the  systen  is  J,  the 
expectation  of  d(y,y')>  where  the  iattor  ia  the  cost 
of  deciding  y  when  y*  is  the  desired  response.  In  the 
context  of  this  nodal.  then,  a  satisficing 
decisionmaker  nust  choose  a  decision  strategy,  i.e., 
two  probability  distritit 'ons  on  u  and  v,  that  result 
in  J  1  J,  where  J  is  the  naxinua  cost  that  can  be 
tolerated. 


The  nodeling  ia  developed  in  the  analytic 
contc-^  of  tt-dinensional  infoimstion  theory.  There 
are  two  quantities  of  priaary  interest.  The  first  of 
theie  is  ertxopy:  given  a  variable  x,  which  it  an 
elenent  cf  ‘J:  1  alphabet  X,  and  occurs  with  probability 
p(x),  the  e^troyy  of  x,  H(x),  is  defined  to  be 


H(x)  ■ 


^  p ( x)  log  p(x) 
x 


(1.1) 


and  is  neasured  in  bits  when  the  base  of  the  logarithm 
it  two.  Entropy  is  alto  known  as  the  average 
inforaation  or  uncertainty  in  x,  where  information 
does  not  refer  to  the  content  of  the  variable  x.  bat 
rather  to  the  average  amount  by  which  knowledge  of  x 
reduces  the  uncertainty  about  it.  The  other  quantity 
of  interest  is  average  antnal  information  or 
trananiaaion:  given  two  variables  x  and  y,  elements  of 
the  alphabets  X  and  T,  end  glvjn  p(x),  p(y),  and 
p(xly)  (the  conditional  probability  of  x,  given  the 
value  of  y),  the  transmission  between  z  and  y,  T(x:y) 
ia  defined  to  be 


T(x:y)  «  H(x)  -  Hy(x) 


(1.2) 


where 


Hy(x>  ■  -  ^  log 

y  * 


(1.3) 


internal  variables,  w,  through  and  an  output 

variable,  y,  also  called  w^.  The  n.IK  states 


i-1 


^  H(w j)  -  T(x:y)  +  Ty (x:wt.wl, . .  ..w^) 

♦  T(wi:w,:...:wN_i:y) 

+  ®x<*a*ws".,*wM-a'y>  (1,S) 


and  ia  easily  derived  using  information  theoretic 
identities.  The  left-hand  aide  of  Eq.  (1.0  refers  to 
the  total  _rate  of  activity  of  the  system,  also 
designated  G.  Each  of  the  quantities  on  the  right- 
hand  tide  hat  its  own  Interpretation.  The  first  term, 
T(x:y),  it  ctlled_the  throughput  rate  of  the  system 
and  it  designated  G(,  It  measures  the  smount  by  which 
the  ontput  of  the  system  it  related  to  the  input. 

Vx:VW. . V.1  ■  *<x:V-a . v.*y) 

-  T(x:y)  (1.7) 


is  called  _the  blockage  rate  of  the  system  and 
designated  Gv.  Blockage  may  be  thought  of  as  the 
amount  of  inforaation  in  the  input  to  the  system  that 
is  not  included  in  the  output.  The  third  term, 
T(w, ;w,i . . . ;wN_t :y) ,  is  called  the  eoordlmatiom  rate 
of  the  system  and  designated  G  .  It  is  the  N- 
dimenaional  transmission  of  the  system;  i.e.,  the 
amount  by  which  ail  of  the  internal  variablas  in  the 
system  constrain  each  other.  _  The  last  term. 
ax<*»*,,s*--->,,JJ-i*y)  designated  Gn  represents  the 
uncertainty  that  remains  in  the  system  variables  when 
the  input  ia  completely  known.  This  noise  should  not 
be  construed  to  be  necessarily  undesirable  as  it  is  in 
communications  theory:  it  may  also  be  thought  of  at 
internally— generated  information,  information  supplied 
by  the  system  to  supplement  the  input  and  facilitate 
the  decisionmaking  process.  The  PLIR  may  be 
abbreviated: 


is  the  conditional  uncertainty  in  the  variable  x, 
given  full  knowledge  of  the  value  of  the  variable  y. 

McGill  (111  extended  this  basic  two-variable 
input-output  theory  to  N  dimensions  by  extending  Eq. 
(1.2): 

N 

T<w--:V  “5H(V  -  n‘w—V 


For  the  modeling  of  memory  and  of  sequential 
inputs  which  are  dependent  on  each  other,  the  nse  of 
the  entropy  rate,  B(x),  which  describes  the  average 
entropy  of  x  per  unit  time,  is  appropriate: 


H(x)  a  lim  i 
m~>® 


H[x(t),  x(t+l)  ,  a  a  a  ,x(t+m-l)  ) 

(1.5) 


Transmission  rates,  T(x:y),  are  defined  exactly  like 
transmission,  but  using  .  entropy  rates  in  the 
definition  rather  than  entropies. 


G  *  G.  +  G.  +  G  +  G 
then 


(1.5) 


The  bounded  rationality  constraint  is  expressed 
by  postulating  the  existence  of  e  naxinua  rate  of 
informat ion~processing ,  or  a  maximum  rate  of  total 
activity,  GsftZ,  «t  which  a  given  decisionmaking  systen 
can  operate  without  overloed.  Note  that  the  addition 
of  memory  to  the  decisionmaking  nodal  increases  the 
total  number  of  variables  in  the  systen  end  nay, 
therefore,  restrict  the  strategies  that  nay  be  nsed  to 
those  with  lower  activity  or  workload.  However, 
executing  a  task  with  memory  may  resclt  in  a  better 
performance  than  that  achievable  in  a  system  without 
memory. 


In  the  next  section,  s  nodal  of  memory  is 
presented.  In  the  third  section,  the  model  la  used  to 
study  the  performance  -  workload  characteristics  of  a 
decisionmaker  assigned  with  the  execution  of  two 
concurrent  tasks  —  the  dual  task  problem. 


Conant's  Partition  Law  of  Information  Rates 
(PLIR)  [12]  is  defined  for  a  system  with  N  -  1 
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2.0  PERMANENT  AND  TEMPOSAKT  MEMORY 
2.1  Introduction 

Mono  17  lo  no  tuned  to  eontitt  of  both  pornunent 
ad  temporary  ttorot  of  Information  which  nap  bo  drawn 
pon  bp  tha  algorithm  In  tha  aituatlon  aaaeasnent  end 
bo  roaponao  aolaction  atagea  during  tha 
acltionnakinf  proaaaa.  Parnanant  nanorp  la  daflnad 
ara  to  contain  valuta  which  ara  eonatanti  that  it, 
bap  nap  not  ba  ravlaad  or  appended  bp  tha  algorithne 
hut  accaaa  than.  Tenporarp  atoraga  containa  valuea 
hich  nap  ba  ravlaad  bp  tha  algor ithnai  for  example,  a 
iaerata  Ealnan  filter  algorithm  would  include 
enporarp  atoraga  of  tha  beat  eatinate  of  tha  praaent 
tata  of  tha  proceaa,  to  ba  uaad  in  tha  next  ltaration 
•f  tha  algorithm.  Tenporarp  nanorp  hat  tha  affect  of 
idding  nanorp  to  tha  algoritbaa  thomaelveai  with 
:  enporarp  nanorp  available,  tha  algorithna  can 
rananbar  valuta  from  one  iteration  to  tha  text.  Tha 
liviaion  of  nanorp  into  parnanant  and  tenporarp  heart 
a  atrong  ratenblanca  to  tha  divition  of  nanorp  that  it 
aada  in  tha  cognitive  aciencaa,  into  long-term  and 
abort-tarn  nanorp  [13]. 

A  third  tppa  of  nanorp,  called  aanaorp  nanorp,  ia 
alao  hppotheaixed  bp  psychologists.  Information  from 
tha  environment  ia  atorad  in  aanaorp  nanorp  before  it 
uadergoea  anp  proceaa ingj  aanaorp  nanorp  night 
therefore  ba  compared  to  a  buffer  atoraga  nodal.  Tha 
latter  allova  tha  ainultaaaout  proceatlng  of 
aequential  atatlatlcallp  dependant  inputa.  Several 
different  nodela  have  bean  developed  [9]  that  depend 
on  tha  claaa  of  inputa  that  tha  apatan  recaivee.  Shift 
regieter  buffara  provide  the  atoraga  rule  nacaaaa rp  to 
proceaa  input  from  a  general  Markov  aourca.  Fixed- 
length  atring  buffara  ara  a  auitabla  nodal  for  tha 
tppa  of  atoraga  found  in  nachinaa.  Variabla-iangth 
atring  buffara  ara  appropriate  nodela  for  aona  tppaa 
of  human  aanaorp  nanorp.  Shift  ragiatar  buffara  ara 
einple,  but  add  a  great  deal  of  aotivitp  to  tha  apatan 
and  reault  la  redundant  proaaaaiag.  Fixed-length 
bnffara  do  not  anffar  from  thaaa  deficianciaa,  but 
introduce  a  aubataatial  delap  which  ia  proportional  to 
the  length  of  tha  atring.  Variable-length  atring 
buffara  have  analler  average  dalap  than  fixed-length 
oaaa,  bat  inornate  tha  overall  activity  bacauaa  of 
their  relative  eonplexitp. 

Tha  nodal  of  parnanant  nanorp  praaantad  ia  thin 
paper  la  alailar  to  long-tarn  nanorp,  la  that 
information  la  atorad  indafinitalp  and  ia  accaaaibla 
bp  information  proceatlng  aechaaitaa.  It  ia  diffaraat 
in  that  new  information  ia  being  added  continucualp  to 
long-term  nanorp;  tha  permanent  nanorp  nodal  ia  thia 
paper  provider  no  nechaniau  ,  for  thia  addition. 
Second,  information  nap  be  loat  Iron  long-term  nanorp; 
thia  permanent  nanorp  nodal  doaa  not  have  a  forgetting 
aeohaaian.  Thaaa  diffarancaa  are  noted  to  indicate 
that,  although  ainilaritiea  exiat  between  the  nodal  of 
nanorp  praaantad  hare  and  that  found  ia  tha  cognitive 
aciencaa,  parnanant  nanorp  ia  not  intended  to  ba  a 
nodal  of  long-tarn  nanorp  par  aa. 

Permanent  nanorp  nap  ba  accaaaad  bp  both  tha 
aituatlon  aaaaaanant  and  tha  reaponaa  aelaction 
atagaa.  However,  in  thia  paper,  conaideration  will  ba 
United  to  tha  aituatlon  aaaaaanant  ataga.  Tha 
ralationahlpa  derived  ara  tha  tana  aa  they  would  ba 
for  tha  reaponaa  aelaction  ataga.  aince  tha  two  halvat 
of  tha  dacialonnaking  proceaa  ara  etructurallp 
identical. 

It  ia  quite  poetible  that  a  dacialonnaking  apatan 
nap  contain  both  buffer  atoraga  and  permanent  ;;i 
tenporarp  unite.  However,  ia  ordar  to  ainplifp  tha 


praaantation,  tha  aaaunption  ia  nada  that  tha 
dacisionuking  apatan  containa  no  buffer  atoraga. 
Thia  aituatlon  ia  relaxed  aaailp  [9]. 

Tha  general  nanorp  unit  applicable  to  both 
parnanant  and  tenporarp  nenorp,  ia  ahown  in  Figure  2. 
It  conaiat  of  M  varlablea,  da  through  dj.,  aa  well  aa 
an  input  N-vector.  Dj,  and  an  output  H-vector  D «. 
Note  that  bacauaa  parnanant  nenorp  nap  not  ba  revised, 
ita  nodal  will  not  contain  tha  input  vector  Dj. 


Figure  2.  Model  of  SA  aubapatan  with  nanorp 


Parnanant  Menorp 


It  night  teen  at  flrat  that  tha  addition  of 
parnanant  nanorp  to  a  dacialonnaking  apatan  night  have 
not  affect  at  all  on  tha  total  information  theoretic 
rata  of  activitp  of  tha  apateni  if  the  value?  of  d^ 
for  k-l,2,...,M  do  not  change  over  tine,  then 


S<dk)  -  0  k  -  1.2.. ...M  (2.1) 


Since  total  aotivitp  it  juat  tha  tun  of  tha  entropiee 
of  tha  individual  variablet  in  the  apatan,  it  appaara 
that  tha  addition  of  M  detarainletic  varlablea  to  a 
•paten  ahould  have  no  effect  on  ita  total  activitp. 
Hovavar.  the  problem  ia  actually  more  complex.  In 
ordar  to  denonatrate  tha  tppaa  of  ehangaa  that  occur 
whan  parnanant  nanorp  la  added  to  tha  nodal,  a 
particularly  einple  example  will  ba  analyzed. 


Let  the  permanent  nenorp  unit  conaiat  of  ona 
variable,  da,  which  nay  be  acceaaed  by  ona  SA 
algorithm,  f,,  at  ahown  in  Figure  3. 


1SIIUATICN_ASSESSMENT  SUBSYSTEM 


Figure  3.  Example  of  SA  aubapatan 
with  parnanant  nanorp 

Algorithm  f,  provider  tha  average  value  of  tha  tvo 
component  •  of  a  vector  input.  Whenever  a  apacific 
algorithm  ia  accaaaad  by  the  deciaionnaking  apatan, 
tha  vari  slat  of  that  algorithm  are  defined  to  ba 
active,  and  taka  valuea  according  to  tone  probability 
distribution  which  ia  a  function  of  tha  input.  Whan 
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that  algorithm  it  sot  accessed,  ita  variables  art 
defined  to  ba  inactive;  l.e.  they  assume  aoaa  fixed 
value,  aay  #,  which  they  nav  not  aaaua  when  they  are 
active. 


Nov  consider  algorithm  ft,  which  provides  the 
aean  valve  of  the  two  components  of  a  vector  inpat. 
There  are  two  similai  ways  of  implementing  this 
algorithm.  The  first  does  not  access  permanent 
memory,  although  there  la  sue  implicit  memory  in  the 
algorithm  itself: 


*»  “  2  *  *s  “  *s  +  xs 

w,  -  »»/*!  t  x  «  w,  (2.2) 

The  second  does  access  permanent  memory: 

dj  •  2  s 

defined  ontside  the  algorithm  ft 
*s  “  <*1  *  *s  “  *z  +  *s 

w,  ■  Vj/vj  t  i  ■  w,  (2.3) 


In  the  second  exuple,  the  variables  *!•**•*» 
are  inactive  when  the  algorithm  is  not  accessed.  In 
the  first  example,  however,  w9  most  retain  the  valve 
of  2  throughout,  since  no  means  have  been  provided  to 
reinitialize  its  value  each  time  the  algorithm  it 
accessed.  It  is  now  possible  to  compare  the  levels  of 
activity  of  the  system  with  the  permanent  memory  unit 
and  that  without.  First  consider  realization  A.  The 
throughput,  blockage,  and  noise  rates  of  the  SA 
subsystem  are  given  by  (91: 


Gt-  H(s)  -  IMx) 


6b  “  Vi:  **' 


'on' 


H(u) 


(2.4) 


With  inputs  arriving  once  every  second,  the 
coordination  rate  is  found  as  follows: 


G 

c 


J  J  ®(wji)  +  "<b)  +  "<l)  "  “(u.W.z) 
i-1  j-1 


(2.3) 


Here,  w,*  represents  the  j-th  variable  of  algorithm  i> 
and  T  represents  the  entire  set  of  Wj  in  the  SA 
subsystem.  Finally.  Oj  is  the  number  of  internal 
variables  of  algorithm  i  which  arc  active  or  inactive 
according  to  the  value  of  u.  Equation  (2.3)  reduces 
to: 

0 

G  «  ^  a,  H[p(u)«i>] 


5  H  (w*)  -  B  (f)  +  D(z)  (2.6) 

L  Q  J  O 


The  symbol  I  denotes  the  binary  entropy  of  its 
arguent,  given  by: 


i-1 

0 

i-i 


■(p)  «  -  p  log  p  -  (l-p)log  (1-p),  0  i  p  i  1  , 

(2.“) 


The  sue  quantities  may  bo  calculated  for  realization 
B.  The  rates  of  throughput,  blockage  and  noise  are  not 
effected  by  the  small  structural  difference  in 
algorithm  flB»  the  rate  of  coordination  does  change. 
Consider  Eq.  (2.0):  ulB  is  now  equal  to  3,  because  wx 
is  now  active  when  u-1  and  inactive  otherwise. 
Therefore,  the  first  term  of  Eq.  (2.6)  is  increased  by 
the  uount  l[p(u-l)].  The  second  and  third  terms 
remain  the  same,  even  though  there  is  now  some 
uncertainty  associated  with  the  value  of  w1.  Knowledge 
of  the  value  of  u  resolves  that  uncertainty,  so  that 

£  (w.)  -  0  (2.8) 
B  * 


Similarly,  i9(V)  is  unchanged.  Only  the  structure  of 
the  algorithm  has  been  changed,  so  the  output  remains 
the  sue,  and  the  last  term  of  Eq.  (2.0  is  unchanged. 
Therefore,  the  addition  of  one  unit  of  permanent 
memory  to  the  SA  subsystem  provides  a  total  increase 
in  activity  of  Kp(u-l)).  In  general.  If  algorithm  1 
directly  accesses  pj  values  from  permanent  memory,  and 
no  other  changes  are  made  la  the  algorithms,  then  the 
incremental  activity  of  the  system,  AG,  is  given  by 

U 

AG  «  H(p(u-i)]  (2.9) 

i-1 


3.0  THE  DUAL-TASK  PROBLEM 
3.1  Introduction 


It  has  been  observed  that  if  a  person  must 
execute  two  tasks  by  switching  between  them,  his  level 
of  performance  may  be  different  than  when  he  is 
allowed  to  confine  himself  to  one  task  [13J ,  [14], 
even  if  the  arrival  rate  for  Individual  tasks  is  the 
tame  for  both  cases.  If  there  is  sons  synergy  between 
the  two  tasks— that  is,  if  the  two  tasks  are  related 
and  executing  one  actually  helps  the  execution  of  the 
other — then  performance  may  improve.  If,  on  the  other 
hand,  the  two  tasks  are  dissimilar  or  simply  do  not 
reinforce  each  other,  performance  may  decline  from 
what  it  was  in  the  single-task  case.  It  is  this 
latter  phenomenon  that  will  be  explored  in  this 
section. 

There  are  numerous  possible  ways  in  which  the 
dual-task  problem  might  he  modeled.  For  example,  if 
the  two  tasks  to  be  performed  are  assumed  to  be  so 
different  from  each  other  that  they  demand  different 
sets  of  algorithms,  then  a  pre-processor  may  be 
required  for  the  system.  The  pre-processc r  determines 
which  type  of  task  each  input  represents  and  then 
allows  access  to  a  set  of  decisionmaking  algorithms 
appropriste  to  that  task.  Of  course,  the  activity  of 
the  pre-processor  increases  overall  system  activity 
and  may,  therefore,  lover  performance.  On  the  other 
hand,  if  the  two  tasks  to  be  performed  are  assumed  to 
be  similar  but  non-synergistic,  they  may  be  able  to 
use  the  same  basic  sets  of  algorithms,  as  lorg  as 
these  algorithms  are  adaptable  to  each  task  through 
two  different  sets  of  psrameter  values  stored  in 
permanent  memory.  Notice  that  there  is ‘an  implicit 
need  for  a  pre-processor  in  this  problem,  since  the 
algorithms  must  have  some  way  of  knowing  which  type  of 
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t  h»i  been  reoeivad  in  order  to  determine  which 
of  uImi  etored  la  memory  to  aeceae.  Or* r«H 
wit y  It  laerottod  la  this  formulation  at  wall  by 

neeaaalty  of  «w itching  batwaaa  aata  of 
nation.  Aa  example  of  thia  aacond  problem  ia  a 
1  awitehboard  operator  who  ha a  to  proceaa  both 
•lag  and  oot going  call*;  although  tha  taak* 
.lea  tha  aaaa  baaie  aotloa.  they  differ  with 
.act  to  tha  information  required  to  axacota  tha 
:*.  Tha  aaooad  problow  la  addraaaad  la  thia  paper; 
flrat  one  will  ba  praaantad  at  a  latar  tiaa. 

Ia  ordar  to  aimplify  thia  problaa,  aawaral 
/  inptioaa  will  ba  aada.  Flrat,  to  circumvent  tha 
I  for  a  pre-proceeaor,  it  ia  aaaoaed  that  thara  ara 
aaparata  iapata  to  tha  ayataa,  *A  and  Xg,  which 
aaabara  of  diajoint  alphabet  a,  X.  aad  Xg.  Only 
of  thaaa  iapata  ia  aotiwa  at  aay  given  tiaa:  if  xA 
active,  tael  A  aaat  ba  parforaad,  aad  if  Xg  ia 
iwa.  tack  B  aaat  ba  parfonad.  Iapata  arrive  at 
ayataa  once  every  aacond.  and  thara  la  a  known 
bability  ID  (rapraaanting  tha  taak  diviaion)  that 
will  ba  active  at  any  given  tiaa. 

If  iapata  ara  not  xynargiatio.  than  they  ara 
aaad  to  ba  atatiatically  independent.  It  x  ia 
orated  independently  every  x  aaconda,  tier 

l(x)  -  |  B(x)  (3.1) 


[_D£C  I S10NM  AKING  _S_YSTEM _ j 

Fignra  4.  Model  of  daciaioaaakar  with  peraanant 
aaaory  for  Dual-Taak  problaa 


3 .2  Iaforaatloa  Tteorettc  Aaalyaia 

Ia  ordar  to  naaaare  if  a  change  ia  perfonance 
level  occara  between  tha  aingle-teak  aad  daal-taak 
aitaationa,  a  performance  index  la  repaired.  Tha 
index  3  that  ia  aaad  ia  tha  probability  of  error.  Ia 
tana  of  tha  quantifier  defined  in  xaction  1  and 
depleted  in  Figaro  1  (with  tha  oatpat  of  tha  ayataa 
now  equal  to  a). 


d(a.x’) 


1 

0 


if  x  *  x' 
if  x  -  x' 


(3.3) 


r». fore,  in  tha  reaalta  which  follow,  entropioa 
.her  than  entropy  ratea  will  ba  aaad.  Activltiea  ara 
lotad  by  0  in  place  of  G  for  activity  ratea.  Tha 
.ta  for  G  are  bltt  par  aymbol  (aa  oppoaad  to  bita 
•  aacond  for  G) .  Note  that  for  tha  probleai  with 
tergy  between  taaka,  tha  aaaanption  of  dependence 
:ween  aaquential  iapata  woald  ba  appropriate;  a 
{far  atoraga  aodel  woald  ba  added  and  activity  ratea 
aid  ba  aaad  in  tha  aaalyaia. 

The  baaio  wodel  for  tha  probleai  la  ahowa  in 
|arc  4.  Tha  variable  a,  which  acta  independently  of 
a  inpat  x,  ooatrola  which  of  two  aitaation 
aaaaaant  (SA)  algorithm*,  fj  and  f2,  will  be 
caaaad.  Tha  dacialoa  atrategy  for  a  ayataa  each  aa 
ia  aay  than  ba  defined  by  tha  probability  &  that  a 
aqua  1  to  1.  For  aimplicity.  it  ia  aaaaaad  that 
a  parpoaa  of  both  taaka  la  merely  to  aaaaaa  tha 
taation,  ao  that  x  ia  tha  oatpat  (no  BS  ataga) . 
a  variablaa  *2  and  ^  *re  rapraaented  aa  awitchaa 
tarnal  to  tha  algorithm*  only  ao  that  their  fanotion 
y  ba  highlighted.  Figara  4  doea  not  explicitly 
pint  tha  n«  chan  law  by  which  *2  and  *2  take  thalr 
laaa,  bat  only  that  they  ara  dependant  on  the  valaa 
‘  a  aad  on  tha  valaaa  of  xA  and  Xg. 

■eeifioally,  thay  taka  valaaa  aa  followa: 

A  if  a  •  i,  x^d  #,  Xg-  f 

t  “  *  if  a  -  i.  xA-  d.  Xgd  f  i  -  1.2 

t  it  a  d  i  (3.2) 


a  addition  to  *2  and  *2,  algorlthaa  fj  and  f2  contain 
aad  a2  internal  variablaa.  Finally,  DA  and  Dg  ara 
a  two  aet*  of  information  or  data  needed  by  the 
lgorithaa  to  proceaa  inpnt  froa  XA  and  Xg, 
aapaotivaly.  It  la  aaanned  that  both  algorlthaa  naa 
11  of  tha  information  la  DA  whan  performing  taak  A 
ad  all  of  Dg  for  taak  B. 


and  tharafora  J,  tha  expectation  of  d(x.x')  ia 

J  -  ^  p(x)d(x,x')  ■  prob(xdx')  (3.4) 


Baceuaa  two  diatinct  taaka  ara  being  performed,  JA  ia 
defined  aa  the  probability  of  error  In  exaenting  a 
type  A  taak;  and  Jg  aa  tha  probability  of  error  ia  a 
type  B  taak.  Mora  preeieely, 

*  prob(x-x'lx  a  X^)  i  *  A.B  (3.5) 


Note  that  thaaa  quant it iaa  ara  independent  of  tha 
taak  diviaion  ID,  the  probability  that  x  a  XA,  bat 
will  be  dependent  in  general  on  the  daciaion  atrategy 
A.  In  fact,  if  it  ia  known  how  tha  eyatam  parforma 
whan  pare  atrategle*  ara  employed  (either  a  ia  1  with 
probability  1,  aad  algorithm  fj  ia  alwaya  need,  or  a 
it  2  and  algorithm  f2  ia  alwaya  aaad),  the  performance 
of  tha  ayataa  under  tha  mixed  atrategy  A  (algorithm  fj 
ia  aaad  with  probability  A)  ia  aiaply  a  oonvax 
combination  of  the  perforaancea  ating  pare  etrateglea 
15).  i.a.. 

3^5)  -  A(J1lnml)  ♦  (1-A)  (jJu-2) 
i  -  A,Bj  0  <  A  <  1  (3.6) 

With  tLia  definition  of  taak  performance,  it  ia  alto 
poaaiblt  to  define  an  overall  performance  index  for 
tha  ayatea: 

1(A)  -  Cn»JA(A)  +  (1-T»)JB(6)  (3.7) 


If  error*  on  one  taak  ara  more  detrinential  than 
arrora  on  tha  other,  than  weighting  coafficianta  may 
ba  introduced  on  tha  right-hand  aide  of  Eq.  (3.7). 
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Ths  utlrltj  of  ths  lyitu  will  ciii|«  both  ae  i 
fraction  of  tho  deoi.lo:  itnt(|y  6,  rad  as  a  function 
of  tha  task  division  TD.  In  fact,  G,  the  total 
activity  of  tha  system,  is  convax  both  in  6  (with  a 
fixed  TD)  and  in  TD  (with  6  fixed) .  Tha  convexity  of 
B  in  6  ha  a  already  bean  shown  IS] i  tha  convexity 
of  0  in  TD  will  be  demonstrated. 

As  suae  that  only  task  A  is  being  performed.  Nota 
that  under  this  asanaption,  tha  need  for  variables  aa 
tad  a,  disappears)  tha  algorithas  may  be  directly 
connected  to  data  baas  DA.  In  this  case,  the  levels 
of  activity  for  a  decialonaaker  with  two  SA  algorithas 
f,  and  fa,  containing  ua  and  oa  internal  variables, 
respectively,  and  a  decision  strategy  6,  are  given  by: 


Gt  ♦  -  I(x) 

Gn  +  H(u)  «  1(6)  2 

G  «  (a  ♦  a  )*(#)  +  H(x)  ♦  5  p(u-i)g*  (3.8) 

C  *  *  Mm  0 

1-1 


the  quantity  d  is  the  entropy  of  a  binary  variable) 
Ig1  rad  gc*  are  defined  to  be  the  internal 
coordinations  of  algoritha:  f,  and  fa,  respectively, 
there  internal  coordination  is  defined  as 


1  .  1 
«c  "  prob(n-i) 


[  ^  H(wj|u*i)  -  H(wMu-i)J 
j-1  (3.9) 


snd  I*  represents  the  eat  of  all  of  the  variables  of 
ilgorithsi  i.  The  total  activity  of  the  system  is  then 
tha  su  of  the  quantities  given  in  Eq.  (3.8): 


G  -  H(x)  ♦  H(i)  +  lax*  at  ♦  1)H(6) 

2 

♦  J  p(u-i)g*  (3.10) 

i-1 

Iota  that  all  of  the  above  quantities  are  conditional 
in  task  A  being  performed:  e.g.,  gei  could  be  written 
(geHxtXA).  It  has  been  shown  [3]  that  G  is  convex  in 
the  decision  strategy,  i.e.: 

C(8)  >  (6) (Glu*l)  +  (1-6) (Glu*2)  (3.11) 


Figure  3.  ^presentation  of  G  vt.  J  for  binary 
variation  of  pare  strategist 

Let  d^  represent  a  tingle  variable  of  tha  permanent 
memory  nnit,  let  D  represent  all  the  permanent  memory 
(both  0A  and  Dj|),  and  let  V  represent  all  of  the 
internal  variables  of  both  algorithms  f,  and  fat  then 
tha  coordination  of  the  system  performing  two  tasks  is 
given  by: 

2  «1 

Ge  “  5  5  H<wj)  +  H(,s)  +  «<‘.»  *  *<»>  +  H(l) 

i-1  j-1 

♦  5  H(d  >  "  H(l,e  ,s  ,u,x,D)  (3.13) 

!  k 

•  i 

After  mnch  nanipnlation,  Eq,  (3.13)  may  be  reduced  to 

i  «  “  («  +o  +2)*(8) 

o  »  * 

2 

♦  J  p(u-l)[(TD)(g*lxsXA)  ♦  (1-TD) (gMxaXg)  1 

i  w 

*  ai 

♦  J  5  T(»j:a4l»-i)  +  ■(*)  (3.14) 

i-1  J-1 

i 

and  the  total  activity  for  the  system  performing  two 
tasks  is  giv?r-  by: 


Therefore,  using  Eqs.  (3.6)  and  (3.11),  G  may  be  found 
parametrically  as  a  function  of  J  for  the  single-task 
problem,  as  shown  in  Figure  3. 

The  dual-task  problem  requires  the  varisbles  st 
■nd  sa  to  be  inclnded  la  the  model.  It  is  still  the 
:ase  that 

Gt  +  Gfc  -  H(x) 

Gb  -  H(u)  -  1(6)  (3.12) 


G  -  H(x)  +  H(i?  ♦  («a+ua+3)E(6) 

2 

♦  }  p(u-i)  [(TD)(g*|xsXA)  ♦  (1-TD)  (gMxsXj)] 
i-1 

2  Bi 

+  5  ^  T<wj:,ilu”i)  (3.15) 

i-1  J-1 

There  are  now  two  additional  H(6)  terms)  these  ere  due 
to  the  presence  of  the  two  additional  system 
variables,  sa  and  sa.  The  internal  coordination  term 
is  now  a  convex  combination  of  the  internal 
coordinations  fonnd  when  only  task  A  or  B  is 
performed.  Finally,  the  last  term  of  Eq.  (3.13), 
which  does  not  even  appear  in  Eq.  (3.10): 
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(3.10 


RHS  -  t(TD)H*(x)  +  (1-TD)HB(i)] 


T(e*:s.lu-i)  *  H(s  !u-i)  -  H  i(»  lu»i) 
j  i  l  m  l 


Thli  nay  be  Interpreted  as  the  amount  of  Information 
transmitted  between  tj  and  w,x,  given  that  algorithm  i 
is  being  need;  i.e.,  it  is  the  extent  to  which 
variable  tr.1  reflects  which  task  is  being  performed. 
Since 

H(Sjlu-i)  -  p(u-i)H(TD)  (3.17) 


then 

0  <  T'v^SjIn-i)  <  p(u=l>H(TD)  (3.18) 


It  will  now  be  shown  that  for  a  fixed  valne  of  6, 
0  £  6  1  1,  G  it  convex  in  the  task  division.  i,c., 

G(TD)  >  (TD)(GlxeX.)  +  (1-TD) (GtxaX-)  0  <  TD  <  1 

(3.19) 


The  right-hand  side  (RHS)  of  (3.19)  may  be  foond  nsing 
(3.10): 

RHS  -  (TDHH^x)  +  HA(x)  +  a%*at +1)1(6) 

2 

+  ^  plu-iHgMxeX^)] 
l-l 

+  (l-TD)(HB(x)  ♦  HB(x)  +  •1+aj+l)H(6) 

2 

+  ]>  P<B-0(g‘lxi:XB))  (3.20) 

1-1 

Here.  H-J(x)  and  H^(x)  are  the  entropies  of  x  and  z 
which  occur  when  only  a  single  task  is  executed.  The 
probability  distributions  fo  x  and  z  in  the  dual-task 
case  are  a  convex  combination  of  those  for  the  single¬ 
task  cases. 

p(x)  *  (TD)p(x|xeXA)  +  (l-TD)p(x|xeXB> 

0  <  TD  <  1 

p(x>  -  (TD)p(x|xeZ  )  +  (l-TD)p(xlxeZD) 

A  B  (3.21) 


Vh<  n  a  probability  distribution  is  the  convex 
coobination  of  two  others,  as  in  Eq.  (3.21),  then 
(161: 

H(x)  2  (TD)!!*!!)  +  (1-TD)EB(x) 

0  <  TD  i  1 

B(t)  l  <TD)HA( x>  ♦  (1-TO)HB(i)  (3.22) 


+  KlTDB^x)  +  (l-TD^U)] 

+  [(«j+at+l )■(»)] 

2 

+  ]>  p(«-i)ITB(*‘l*«XA)  +  (1-TD)  (gMxeXp)J 
1-1  (3.23) 


Now  compare  Eq.  (3.23)  to  Eq.  (3.13).  nsing  the 
results  of  Eq.  (3.22).  th.  faot  that  1(8)  1  0.  and  the 
fact  that  transmissions  most  also  be  non-negative.  It 
follows  that  Eq.  (3.19)  does  indeed  hold,  and  H  it 
convex  in  the  task  division.  In  fact,  if  a  mixed 
strategy  8  is  being  need  (0  <  &  <  1),  or  if  any  of  the 
internal  variables  of  an  algorithm  in  nse  reflects 
which  task  is  being  performed,  i.e., 

TfwJiSjIu-i)  >  0  (3.24) 
then  the  inequality  of  Eq.  (3.19)  will  be  strict. 


3.3  Effect  of  Task  Division  on  Performance 

To  see  the  effects  that  this  result  has  on 
performance,  consider  a  particularly  simple  example. 
It  is  essuaed  that  the  aingle-task  ectivity  or 
workload  versus  peformance  curves  are  identical  for 
task  A  and  task  B  (this  implies  that  I.  and  Jg  are  the 
same  functions  of  8:  see  Figure  6a).  Now  consider  the 
evolution  of  the  G  versus  3^  curve  aa  TD  changes  from 
0  to  1.  It  is  meaningless  to  define  J  ^  for  the 
single-task  case  in  which  task  B  is  always  performed 
(TD  -  0),  but  for  very  small  values  of  TD,  is 
dei load  as  in  Eq.  (3.3).  To  find  the  G  versus 
curve  for  TD  s  0,  consider  Eq.  (3.13).  Since  l(TD)  s 
0  for  TD  s  0,  its  last  term  is  small  (see  Eq.  (3.18)). 
The  rest  of  Eq.  (3.13)  reduces  to  Eq.  (3.23): 


G(TDSO)  s  (GlxsXg)  ♦  2«(6)  (3.23) 


lu  other  words,  the  G  versus  curve  will  be  the  same 
as  either  single-task  curve,  with  the  quantity  211(8) 
added  on  due  to  the  pretence  of  variables  t*  and  a, 
(see  Figure  6b).  As  TD  increases,  G  will  continue  to 
increase  up  to  some  point  (because  of  its  convexity  in 
TD),  dependent  on  the  value  of  the  last  term  of  Eq. 
(3.13)  and  the  values  of  B(x)  and  B(x).  For  TD  equal 
to  0.3,  the  G  versus  J .  curve  will  have  the  general 
shape  shown  in  Figure  6c.  Finally,  G  will  decrease 
until  It  t  1  and  G  versus  is  again  at  shown  in 
Figure  6b.  For  a  fixed  value  of  8  then,  say  8-0.2, 
the  workload  versus  task  division  curve  will  be 
similar  to  that  shown  in  Figure  6d.  The  maximum 
activity  need  not  occur  at  TD  -  0.3. 


snd  it  follows  that  Ee  (3.20)  can  be  written  at: 


4.0  CONCLUSION 


Fiior*  6.  Perforatnc*  vt.  Workload  tod  Talk  Division 

Now  consider  what  happens  to  performance  if  the 
aaxiaua  total  workload  constraint  ia  givon  by  the 
value  narked  C  in  Fi|are  6,  i .*,,  the  syatea  is 
required  to  perform  at  an  activity  level  G  1  0MJ.  Ia 
the  two  single-task  cases,  the  syatea  is  unconstrained 
and  aay  use  any  strategy  0  1  6  1  1.  However,  for  this 
exaaple.  when  both  tasks  arrive  with  equal  probability 
(TD  -  0.5),  the  tat  of  feasible  strategies  is  greatly 
reduced,  and  performance  is  Halted  to  being  very 
poor.  Also,  the  particular  strategy  of  A  «  0.2  aay 
only  be  used  for  task  divisions  close  to  0  or  1  (see 
Figure  dd). 

This  sinpl*  exaaple  illustrate*  sob*  rather 
gen.-ral  results.  The  convexity  of  G  in  the  tesk 
division  iaplies  that  the  rate  of  activity  of  the 
syatea  will  be  greater  ia  the  dual-task  case  than  in 
at  least  on*  of  the  single  tasks.  If  the  workload  of 
the  two  tasks  is  very  disparate,  then  the  opportunity 
to  switch  between  a  very  activity-intensive  (high 
workload)  task  end  a  very  easy  one  aay  actually  reduce 
the  workload  from  what  it  it  in  the  case  that  only  the 
difficult  task  it  being  performed.  When  the  activity 
levels  for  the  two  single-task  cases  are  coaparable, 
though,  as  in  the  preceding  exaaple.  the  workload  for 
the  dual-task  case  is  greater  than  that  for  either  of 
the  single-task  cases.  This  increase  in  workload 
arises  froa  three  basic  sources,  which  aay  be  seen  by 
an  exanination  of  Eq.  (3.15).  First,  in  the  dual-task 
case,  the  variable  x  and  in  nost  cases  alto  the 
variable  x  will  have  a  larger  nncertainty  associated 
with  thea  because  of  their  larger  alphabets  Second, 
the  dual-task  prob!  :i»  requires  that  the  syatea  have 
tone  aeans  of  switching  between  the  sets  of  data 
stored  ia  permanent  memory:  the  variables  i,  and  a, 
provide  the  aechanisa  but  also  increase  the 
uncertainty  of  the  system.  Third,  the  rest  of  the 
internal  variables  aay,  because  of  access  to  different 
values  stored  in  aemory,  take  on  a  wider  range  of 
values  when  both  tasks  aust  be  perforned  than  when 
only  one  is  performed.  If  the  systea  performing 
either  task  alone  is  operating  near  its  aaxiaum 
allowable  rate,  then  requiring  the  systea  to  switch 
between  the  two  tssks  hat  the  effect  of  both 
eliminating  the  more  active  decision  strategies  froa 
the  feasible  set,  and,  in  the  case  that  the  nore 
active  strategies  also  result  in  better  perforaanee, 
lowering  the  perforaanee  of  the  systea. 


In  order  to  obtain  aor*  realistic  aodels  of 
huaana  carrying  out  information  processing  and 
decialonaaking  tasks,  it  is  necessary  that  aeaory, 
whether  internal  to  the  decision  process,  or  external 
in  the  fora  of  data  base*,  be  aodeled.  Three  classes 
of  aodels  are  described:  buffer  storape.  permanent 
aeaory  and  teaporary  aeaory.  The  modeling  of 
persianent  aeaory  has  been  prsented  and  illustrated 
through  its  use  to  the  analysis  of  the  performance- 
workload  characteristic  of  a  human  decisionmaker 
executing  a  dual  task. 

In  order  to  test  experiaentally  these  predictions 
the  model  for  the  dual-task  problem  as  defined  here, 
several  criteria  aust  be  act.  First,  the  two  tasks 
aust  be  similar  enough  that  the  same  set  of  algorithms 
aay  be  used  for  both  tasks:  however,  they  should  be 
independent  enough  so  that  execution  of  one  task  does 
not  aid  in  the  execution  of  the  other.  Second,  it 
should  be  necessary  to  switch  between  tasks,  i.e.,  two 
different  tasks  aay  not  be  performed  simultaneously. 
Third,  Individual  tasks  should  arrive  at  the  sane  rate 
in  the  dual-task  test  at  ia  the  tingle-task  test. 
Finally,  this  rate  of  presentation  should  be  near  to 
the  bounded  rationality  constraint  of  the 
decisionmaker,  since  it  is  hypothesised  that  it  is 
this  constraint  that  leads  to  perforaanee 
degradation. 
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ABSTRACT 


In  order  to  describe  the  overall  behavior  of  a  man- 
machine  system  it  has  long  been  recognized  that  there 
'  exists  a  need  to  determine  the  level  of  "workload" 
imposed  on  the  human  operator  in  achieving  design  objec¬ 
tives.  One  of  the  objectives  of  workload  related 
research  is  to  determine  areas  of  high  workload,  a  sub¬ 
set  of  which  entails  the  domain  of  information  over¬ 
load.  Several  mechanisms  of  adjustment  to  increasing 
information  processing  demands  are  available  to  the 
decision  maker;  e.g.  queueing,  filtering,  omission,  and 
employing  multiple  channels.  These  mechanisms  of 
adjustment  have  an  effect  on  the  human's  operative 
behavior  and  subsequent  strategy.  When  presented  with 
increasing  Information  processing  demands  near  or  in 
excess  of  his  capacity  the  decisionmaker  exhibits 
changes  in  operating  methods  to  avoid  crossing  an  over¬ 
load  threshold.  Hence  the  operative  methods  of  the 
decisionmaker  are  efficient  from  the  point  of  view  of 
performance,  while  being  economical  frotq  the  point  of 
view  of  workload. 

The  Dynamic  Decision  Model  (DDM)^is  a  normative- 
descriptive  model  that  has  shown  to  provide  an  excellent 
representation  of  human  information  processing  and 
decisionmaking  in  a  dynamic  multi-task  eavlranaent ,  In 
the  present  effort, '•li  sensitivity  study  was  performed 
on  the  DDM  in  an  attempt  to  explore  the  nature  by  which 
various  dynamic  task  attributes  affect  human  perfor¬ 
mance.  In  particular,  changes  in  parameters  such  as  the 
number  of  concurrent  tasks,  task  velocity,  task  proce¬ 
ssing  time  and  task  value,  were  Investigated  in  con¬ 
junction  with  existing  notions  concerning  human  operator 
workload.  This  global  approach  not  only  emphasizes  the 
performance  level  the  human  operator  may  attain  but  also 
the  tactics  and  strategies  he  uses  to  achieve  it.(”  —  — 

Results  Indicate  a  general  agreement  with  existing 
workload/performance  theories  and  some  inherent  human 
information  processing  limits  are  Identified.  Extensive 
experimental  studies  were  performed  along  these  lines 
by  using  an  experimental  paradigm  which  abstracts  some 
feature  of  a  C-*  decisionmaking  situation.  They 
quantitatively  confirm,  to  a  large  extent,  the  DDM 
analytical  predictions  of  human  performance  sensitivi¬ 
ties.  In  particular,  the  data  show  an  Interesting 
feature  of  human  behavior:  the  ability  to  adapt  to  high 
workload  situations  by  discriminating  some  alternatives 
and  maintaining  performance  through  some  "satisfaction" 
criterion.  A  subjective  workload  rating  technique 
(SWAT)  was  used  to  confirm  the  perceived  increase  in 
information  processing  demands. 

/  I  INTRODUCTION 

1.1  Background  and  Motivation  of  the  research 

Previous  research  at  the  University  of  Connecticut 
CYBERLAB  wfs  aimed  at  modeling  single  human  decision¬ 
making  processes  in  multiple  task  environments.  The 
experimental  part  of  this  effort  focused  on  developing 
a  canonical  decision  paradigm  through  which  the  deci- 
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sionmaker's  performance  and  behavior  were  studied  as 
task  parameters  were  changed  11-3] .  This  paradigm  was 
motivated,  in  large  part,  by  the  problem  of  target 
selection.  In  this  situation,  targets  of  various  type 
move  across  the  display  scopes  of  the  human  operator, 
vying  for  his  attention.  Each  target  has  a  different 
threat  value  and  processing  time  requirement.  The 
human,  therefore,  is  faced  with  the  problem  of  se¬ 
quencing  tasks  dynamically  so  as  to  maximize  the  per¬ 
formance  of  the  system. 

Fig.  1  shows  the  fundamental  decision  loop  that  is 
considered  in  our  approach.  The  human  information  and 
decision  process  Involves  1)  whether  to  process  a  task 
or  gather  more  information  (i.e.  monitor);  and  2)  which 
one  of  N  tasks  to  act  upon  (N  in  time  varying),  in 
order  to  maximize  the  system  performance.  The  decision 
loop  is  dynamic  in  nature.  As  time  evolves,  tasks  of 


Fig.  1.  MULTI-TASK  DYNAMIC  MONITORING/DECISION  LOOP 

different  value,  duration  (processing  time)  and  oppor¬ 
tunity  window  demand  the  operator's  attention,  while 
others  depart.  The  opportunity  windows  shrink  with 
time  as  the  tasks  approach  theln  deadlines. 

In  a  supervisory,  decisionmaking  situation  the 
human  operator  must  process  information  presented  en 
route  to  choosing  an  .ppropriate (course  of  action. 

With  regard  to  the  multi-task  monitoring  loop,  several 
sources  of  uncertainty  must  be  deWlt  with.  These 
include  human  produced  distortions  in  observing  vari¬ 
ables  presented  on  a  display,  uncertainty  in  deter¬ 
mining  the  status  or  state  of  the  Isystem,  various 
hypotheses  as  to  possible  courses  of  action,  and  the 
difficulty  in  envisioning  consequences  of  actions  sub¬ 
ject  to  the  overall  task  objective).  Clearly  ail  of 
these  sources  of  uncerta'nty  are  influential  in  deter¬ 
mining  the  amount  of  workload  imposed  on  the  human. 

In  the  present  paper,  a  joint  analytical  -  experi¬ 
mental  sensitivity  study  was  performed  to  investigate 
how  changes  in  the  information  processing  and  decision- 
r'king  load  affects  the  decisionmaker's  performance 
at.i  operative  behavior.  The  objectives  of  this  effort 
we  re  multifold: 


i 1)  To  analyze  Che  effect  of  variations  In  the  workload 
Imposed  on  the  operator  in  regard  to  his  performance 
and  operational  behavior. 

(2)  To  identify  and  understand  limits  on  human  informa¬ 
tion  processing  and  action  selection,  especially  in 
the  case  of  very  high  workload,  and  attendant 
decreasing  performance . 

(3)  To  try  to  find  appropriate  measures  of  a  priori 
objective  task  difficulty  and  to  compare  them  to 
previously  known  measures  [4,7]. 

(4)  To  validate  the  Dynamic  Decision  Model  (DDM)  pre¬ 
dictions  of  human  performance  across  a  wider  range 
of  task  parameters.  Specifically,  to  do  a  model- 
data  comparison  while  varying  experimental  param¬ 
eters  such  as  the  number  of  task  channels,  the 
velocity  of  the  tasks,  the  task  values,  and  the 
processing  time  range. 

(5)  To  compare  experimental  results  with  subjective 
measures  of  workload.  The  technique  used  here  is 
the  SWAT  (Subjective  Workload  Assessment  Technique) 
t6J. 

The  paper  is  organized  as  follows;  in  the  remainder 
of  this  section  a  brief  description  of  the  Dynamic 
Decision  Model  (DDM)  and  the  experimental  paradigm  used 
to  validate  it  will  be  given.  Part  II  will  describe 
the  simulations  and  experiments  done  for  the  sensitivity 
studies  after  which  Part  III  will  describe  some  of  the 
main  results  in  teitas  of  model  predictions  and  model 
data  comparisons,  jin  Part  IV  a  study  of  a  proposed 
apricrl  measure  of  task  difficulty  in  terms  of  the  ratio 
of  average  time  required/time  available  will  be  pre¬ 
sented.  Part  V  discusses  a  comparison  between  experi¬ 
mental  results  and  subjective  workload  prediction  using 
SWAT.  Comments  on  the  implication  of  the  results 
obtained  on  workload,  human  strategy  changes  and  the 
DDM's  predictions,  as  well  as  considerations  leading  to 
future  -esearch  directions,  will  conclude  the  paper. 

1.2  Review  of  the  Dynamic  Decision  Model 

Our  main  analytical  tool  will  be  the  Dynamic 
Decision  Model  (DDM)  previously  developed  by  Patipatti, 
Klelnman  and  Ephrath  [1-3].  This  normative-descriptive 
model  contains  several  Interesting  features.  First, 
the  analytic  framework  of  DDM  is  based  on  optimal  con¬ 
trol,  estimation,  and  semi-Markov  decision  process 
theories.  Thus,  this  approach  provides  a  general 
methodology  for  analyzing  dynamic  decisionmaking  under 
uncertainty.  Second  the  model  Introduces  the  important 
concepts  called  the  "task"  state  and  "decision"  state. 
The  task  state  is  the  detailed  description  of  the  inter¬ 
nal  variables  associated  with  each  task  (position, 
velocity,  type,  etc.).  The  decision  state,  which  is  a 
functional  transformation  of  the  task  state  is  the  mini¬ 
mum  number  of  variables  that  provides  different  infor¬ 
mation  for  making  decisions  (time  available,  time 
required  to  complete  each  task) .  Third,  the  DDM 
employs  the  widely  -  validated  Kalman  filter  theory  in 
an  information  processing  submodel  to  provide  the  con¬ 
ditional  mean  and  covariance  of  the  decision  state. 
Fourth,  and  perhaps  most  Important,  the  DDM  explicitly 
incorporates  human  limitations  in  the  information  pro¬ 
cessing  as  well  as  in  the  decision  stages.  These  in¬ 
clude  the  reaction  time  delays,  randomness  (i.e. 
observation  noise),  liu.it.-d  corah  iiiatuLial  capabilities 
and  randomness  in  decisions. 

A  block  diagram  of  the  DDM  is  shown  in  Fig.  2. 

Each  of  the  N  tasks  in  the  opportunity  window  is  re¬ 
presented  by  a  dynamic  system  acted  on  by  disturbances 
to  account  for  the  nonstationaritles  in  task  character¬ 
istics.  The  human' b  perceived  outputs  are  de¬ 

layed,  noisy  versions  of  the  task  states  \xTi }  and  are 
contingent  upon  the  monitoring  process.  The  perceived 
outputs  are  processed  to  produce  the  best  linear  un¬ 
biased  estimates  of  the  task  states  txpj}  and  their 
associated  covariances  (Ej)  via  a  Kalman  filter-pre¬ 
dictor  submodel  previously  used  in  the  Optimal  Control 
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Fig.  2 

Model  (OCM)  methodology.  The  statistics  of  the  task 
states  {xj^,E^}  are,  in  turn,  used  to  determined  the 
first  and  second  order  statistics  of  the  decision  state 
{Tjji.Oj^i}  time  required  and  time  available. 

The  statistics  of  the  decision  states  along  with  the 
task  values,  r^(t),  are  combined  to  determine  the 
attractiveness  measure,  M^(t),  of  each  task  in  the 
opportunity  window.  Subsequently,  the  measures  are 
used  to  generate  the  probability  Pdi(t)  of  acting  on 
each  of  the  N  tasks  and  the  probability  Pdo(t)  of  not 
acting  on  any  task  (or  the  monitoring  probability, 
Fdm^))- 

The  DDM  is  capable  of  predicting  various  perfor¬ 
mance  measures,  such  as:  the  total  reward  earned,  the 
percentage  of  total  possible  reward  earned,  the  number 
of  tasks  processed,  the  total  amount  of  time  spent 
acting  on  tasks,  etc.  [1-3]. 

1-3  The  Experimental  Paradigm 

A  simple,  yet  realistic,  computer  controlled  ex¬ 
perimental  set-up  was  considered  as  indicated  in  Fig. 3. 

In  the  experiments,  the  subjects  observe  a  CR~ 
screen  on  which  multiple,  concomitant  tasks  are  repre¬ 
sented  by  moving  rectangular  bars.  The  bars  appear  at 
the  left  edge  of  the  screen  and  move  at  different  ve¬ 
locities  to  the  right,  disappearing  upon  reaching  the 
right  edge.  Thus,  the  screen  width  represents  an 
"opportunity  window". 

The  height  of  each  bar  corresponds  to  the  reward 
(value)  of  the  task.  The  amount  of  time  required  to 
process  a  tank  (in  seconds)  is  represented  by  the 
number  o*  dots  displayed  on  a  bar.  A  task  is  processed 
by  the  subject  when  he  pushes  the  appropriate  push¬ 
button  as  shown  in  Fig.  3.  By  processing  a  task  suc¬ 
cessfully,  the  subject  is  credited  with  the  correspond¬ 
ing  reward  (r^),  and  the  completed  task  is  eliminated 
from  the  screen.  An  attempt  by  the  decisionmaker  to 
act  on  a  task  that  cannot  possibly  be  completed  (i.e. 
the  time  required  is  greater  than  the  time  available) 
constitutes  an  error.  No  partial  credit  is  given. 
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In  the  present  experimental  paradigm,  from  which 
the  DDM  was  developed,  several  factors  Influence  the 
decisionmaker's  workload.  The  humon  must  process  infor¬ 
mation  involving  the  amount  of  time  required  (processing 
time  dots),  the  value  of  a  particular  task  (height),  and 
the  position  and  velocity  of  the  task  relative  to  the 
tasks 's  opportunity  window  deadline.  All  of  these  fac¬ 
tors  must  be  considered  for  each  of  the  N  task  lines 
(channels)  that  are  simultaneously  vying  for  the  deci¬ 
sionmaker's  attention.  By  processing  this  Information 
(i.e.  reducing  his  uncertainty  about  the  variables  pre¬ 
sented),  the  decisionmaker  then  uses  his  knowledge  of 
the  "state"  of  the  system  to  develop  an  appropriate 
choice  of  which  task  to  act  upon. 

II  METHODOLOGY  FOR  THE  SENSITIVITY  STUDY 
II. 1  Model  Simulation 

On  the  basis  of  existing  notions  concerning  the 
effect  of  various  task  parameters  on  the  human  opera¬ 
tor's  performance  and  subsequent  workload  (4,7)  four 
task  attributes  were  considered.  They  include: 

A)  different  task  velocities, 

B)  different  task  values, 

C)  different  task  processing  times, 

D)  varying  the  number  of  task  lines  (channels)  to 

be  monitored. 

As  indicated  in  Fig.  3,  the  task  velocity  is  in¬ 
versely  related  to  the  time  available  in  which  to  suc¬ 
cessfully  engage  and  process  a  task.  More  specifically, 
the  time  available  to  process  task  1  at  time  t  is  given 
by 


Tai(t) 


L-^(t) 

vi(t) 


where 


L  is  the  length  of  the  opportunity  window  (1024 
screen  units)  -  1  inch  «  85  s.u. 
t  is  the  position  of  the  task  from  the  left  edge 
(in  screen  units) 

is  the  velocity  of  the  task  (screen  units/ 
second) 

It  was  postulated  that  by  increasing  the  average 
task  velocity  a  decrease  in  performance  (as  measured  by 
the  number  of  tasks  completed  and  the  total  reward 
earned)  would  be  obtained.  A  similar  relationship  be¬ 
tween  performance  and  the  average  processing  time  of 
the  tasks  was  expected,  since  as  more  time  was  required 
to  process  a  task  fewer  tasks  could  be  completed  prior 
to  leaving  the  opportunity  window. 

By  increasing  the  number  of  distinct  task  values 
or  the  number  of  Cask  channels  (lines)  which  must  be 
monitored  simultaneously,  a  decrease  in  performance  was 
expected  due  to  the  increased  demands  placed  on  the  in¬ 
formation  processing  portion  of  the  model  in  conjunc¬ 
tion  with  increasing  uncertainty  (i.e.  conflicts)  among 
decision  alternatives. 

II. 2  The  Experimental  Program 

To  parallel  most  of  the  sensitivity  studies  per¬ 
formed  on  the  DDM  by  model  simulation,  the  experimental 
paradigm  described  in  1.3  was  used.  For  each  experi¬ 
mental  setting,  3  different  subjects,  all  graduate 
students  in  the  Department  of  Electrical  Engineering 
and  Computer  Science  at  the  University  of  Connecticut, 
were  tested  on  2  different  experiments  (random  arrivals) 
yielding  6  values  for  each  average  data  point  obtained. 
The  duration  of  each  experiment  was  90  sec  and  the  sub¬ 
ject  had  immediate  visual  feedback  on  his  overall  per¬ 
formance.  The  subjects  were  trained  on  various  experi¬ 
mental  conditions;  however  data  was  taken  only  after  a 
certain  level  of  subjective  confidence  was  felt  by  the 
subjects. 

At  the  end  of  each  sequence  of  experiments,  a  short 


interview  was  designed  to  try  to  determine  the  various 
subjective  strategies  used.  Overall,  it  was  interest¬ 
ing  to  remark  that  although  each  of  the  subjects  used 
different  strategies  to  maximize  their  rewards,  the 
intra  and  inter  subject  differences  in  performance  were 
minor.  Therefore,  the  experimental  results  presented 
in  the  next  part  will  describe  only  the  mean  performance 
and  not  the  inter  subject  variations. 

Ill  RESULTS  AND  MODEL  -  DATA  COMPARISONS 

Only  the  most  significant  results  will  be  presented 
here.  A  complete  presentation  of  the  DDM  sensitivity 
study  can  be  found  in  [5]. 

III. I  The  Sensitivity  to  Task  Velocity 

With  regard  to  task  velocity  three  separate  DIM 
computer  simulations  were  performed: 

1)  all  tasks  had  the  sams  velocity  which  was  varied 
from  25  screen  units  (su)  per  second  to  200  su/sec. 

2)  all  tasks  had  the  same  mean  velocity  with  uniform 
distribution  range  of  +25  su/sec,  the  mean  varied 
from  25  su/sec.  to  175  su/sec. 

3)  all  tasks  had  the  same  mean  velocity  of  100  su/sec 
with  uniform  distribution  ranges  of  0,  +25,  +50, 

+75,  +100.  . . 

Results  from  part  (1)  are  in  general  agreement 
with  our  hypothesis  concerning  the  relationship  between 
velocity  and  the  time  available  to  process  a  task.  As 
expected  a  sharp  decrease  in  the  percentage  reward 
earned  was  observed  as  the  velocity  of  the  tasks  was 
increased.  It  does  appear,  however,  that  there  is  a 
tendency  for  the  model  to  reach  a  "saturation  point"  as 
indicated  by  the  number  of  tasks  (20+2)  completed  and 
in  the  total  expected  reward  earned  during  the  simula¬ 
tion  duration  (90  sec)  . 

In  part  (2)  of  the  velocity  study,  the  mean  veloc¬ 
ity  was  increased  in  a  manner  similar  to  part  (1)  but 
with  some  uncertainty  in  this  value.  Results  were 
expected  to  show  a  general  agreement  with  those  obtained 
in  part  (1),  which  they  did:  The  DDM  performance  was 
affected  very  little  by  the  small  uncertainty  (+25Z)  on 
the  task  velocity. 

Finally,  part  (3)  attempted  to  investigate  the 
effect  of  various  amounts  of  uncertainty  on  the  velocity 
of  each  task  as  perceived  by  the  information  processing 
portion  of  the  model.  A  mean  velocity  of  100  su/sec 
was  chosen  for  all  the  tasks  with  the  interval  over 
which  the  velocity  was  qpiformely  distributed  (e.g.  the 
velocity  standard  deviation)  being  increased.  Results 
indicated  that  the  model  was  able  to  successfully  over¬ 
come  the  perceived  uncertainty  in  task  velocity  and 
develop  a  strategy  to  improve  performance,  although 
this  may  have  been  a  by-product  of  having  more  than 
enough  time  in  which  to  process  the  tasks. 

The  experiments  done  for  the  sensitivity  study  on 
task  velocity  paralleled  the  simulation  study  (2)  where 
all  the  tasks  have  the  same  mean  velocity  (with  a  small 
uncertainty  of  +25  su/sec)  which  was  varied  from  "slow" 
to  "fast".  Here  again  the  general  hypothesis  was  that 
the  performance  of  the  subjects  (in  terms  of  percentage 
of  reward  earned)  will  degrade  as  the  velocity  of  the 
tasks  was  increased,  due  to  the  corresponding  decrease 
in  the  time  available  to  process  a  task  (i.e.  the 
faster  the  task,  the  sooner  it  reaches  the  end  of  the 
opportunity  window).  Fig.  4  presents  model/data  com¬ 
parison  in  terms  of  thp  percentage  of  reward  earned 
and  the  total  rewar.  earned.  Model  predictions  and 
experimental  results  were  in  excellent  agreement  up  to 
the  "nominal"  situation  of  a  mean  velocity  of  75  su/sec 
(a  task  of  moderate  velocity).  Beyond  that  point,  the 
human  subject  performed  better  than  the  model,  re¬ 
flecting  a  process  of  adjustment  by  the  human  to  the 
increasing  workload  demands  (an  ability  the  model 
doesn't  have  at  present). 
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Fig.  4  SENSITIVITY  TO  VELOCITY 

In  summarizing  Che  velocity  study,  it  seems  appar¬ 
ent  that  the  most  Influential  aspect  of  velocity  on 
performance  manifests  itself  in  the  resulting  time 
available /time  required  trade-off  that  must  be  evaluated 
in  cask  selection.  Clearly  the  information  processing 
aspect  of  uncertainty  in  the  perceived  velocity  played 
a  lesser  role  in  determining  the  resulting  performance. 
This  result  is  in  agreement  with  existing  notions  re¬ 
garding  available  time  to  process  a  task  and  the  per¬ 
formance  achieved,  and  forms  a  central  theme  of  the 
workload  theories  which  subscribe  to  the  notion  that  the 
rate  at  which  tasks  must  be  processed  is  indicative  of 
the  workload  imposed  on  the  operator.  In  high  workload 
situati-ms,  the  divergence  in  performance  between  model 
predictions  and  human  subjects  seems  to  signal  the  fact 
that  the  human  develops  a  strategy  that  adapts  to  high 
workload  demands  (high  task  speed,  short  inter-arrival 
times) . 

III. 2  The  Sensitivity  to  the  Range  of  Task  Values 

In  this  study  it  was  hypothesized  that  by  in¬ 
creasing  the  number  of  possible  reward  values  of  the 
tasks  a  resulting  increase  in  the  uncertainty  of  selec¬ 
tion  due  to  conflicts  would  appear.  Results  of  this 
part  of  the  model  simulation  study  showed  consistent 
performance,  as  measured  by  the  total  percentage  tasks 
completed,  across  the  range  of  task  values.  It  seems 
apparent  that  the  model  was  able  to  successfully  dis¬ 
criminate  tasks  of  low  value  from  those  of  high  value 
and  adopt  a  strategy  reflecting  this  segregation. 

As  a  way  to  verify  the  assumption  that  the  model 
was  able  to  discriminate  high  valued  tasks  from  low 
valued  tasks,  a  comparison  of  the  actual  reward  earned 
and  the  mean  value  of  the  tasks  multiplied  by  the  number 
of  tasks  earned  was  performed  (see  Fig.  5).  Clearly, 
here  we  see  that  the  model  was  able  to  segregate  the 
more  desirable  (high  valued)  tasks  to  be  processed. 

With  regard  to  overall  performance,  a  no rmallzed 
reward  (actual  reward  earned/average  task  value)  was 
computed  and  plotted  in  (Fig.  6).  Here  again  the  con¬ 
sistent  performance  achieved  by  the  model  is  exhibited. 

Summarizing  the  study  on  the  relationship  between 
the  performance  achieved  and  the  number  of  distinct 
task  values.  It  appears  that  the  model  is  able  to  sue- 
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Fig.  5  SENSITIVITY  TO  THE  RANGE  OF  TASK  VALUES 

cessfully  segregate  more  desirable  tasks  to  be  processed 
from  lesser  ones.  By  doing  so,  consistent  performance 
is  achieved.  Hence  it  appears  that  the  range  of  possi¬ 
ble  values  produces  a  strategy  which  Is  able  to  com¬ 
pensate  for  any  conflicts  in  choice  such  that  a  "satura¬ 
tion  level"  in  performance  is  obtained. 
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Fig.  6  NORMALIZED  REWARD 

It  appears  that  a  heuristic  measure  of  an  a 
priori  absolute  attractlvity  of  a  task  1  is  a  ratio  y^ 

m  _ Value  of  task  1 _ 

'i  Time  required  to  process  task  i 

which  induces  a  multi-level  threshold  strategy: 

1.  for  Yj>y'  "Always  try  to  process  task  i" 

2.  for  Yt<Y"  "Never  try  to  .process  task  i" 

3.  for  Y'jLTj^Y’  "Depends  on  concurrent  tasks" 

For  example:  for  the  simulation  with  10  different  task 
values  and  5  different  processing  times  the  values  y'~ 
6.2  and  y"“^.3  have  been  found. 

Note :  No  experiment  was  performed  in  the  study  of  sen¬ 
sitivity  to  the  number  of  task  values. 

III. 3  Sensitivity  to  the  Range  of  Task  Processing  Time 

In  this  study,  the  range  of  possible  seconds 
needed  to  process  a  task  was  varied  from  ail  tasks 
having  the  same  processing  time  of  1  second  to  that  of 
the  tasks  having  anywhere  from  1  to  8  seconds  processing 
time.  The  velocity  of  the  tasks  was  chosen  to  be  a 
nominal  value  (100  screen  units/sec)  corresponding  to  a 
moderately  paced  task  with  a  small  standard  deviation 
about  this  value.  Here  again  performance  was  expected 
to  decrease  as  a  result  of  the  time  available/time  re¬ 
quired  trade-off  that  affects  the  decisionmaker's 
ability  to  process  tasks  prior  to  their  deadlines.  In¬ 
deed  both  model  and  experimental  results  confirm  this 
hypothesis  (Fig.  7). 

As  can  be  seen  from  the  plots,  the  model  and  human 
are  In  good  agreement  up  until  the  nominal  case  of 
tasks  which  hod  a  processing  time  range  of  1-3  seconds. 
After  this  nominal  situation  the  human  processed  tasks 
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Fig.  7  SENSITIVITY  TO  PROCESSING  TIME  RANGE 

with  processing  times  which  were  below  the  average  value 
of  the  tasks  presented  to  him.  This  change  in  the  type 
of  tasks  processed  by  the  human  reflects  a  change  in 
strategy  that  accounts  for  the  better  performance 
achieved  as  compared  with  the  model.  In  other  words, 
the  human  adapts  a  more  efficient  behavior  as  the  in¬ 
formation  load  (range  of  possible  processing  times)  in¬ 
creases.  He  discriminates  among  the  tasks  that  are 
favorable  (small  processing  time)  from  those  deemed  un¬ 
favorable  (large  processing  time  hence  smaller  time 
available  to  process  the  task  prior  to  its  deadline) . 

III.  4  Sensitivity  to  the  Number  of  Task  Channels 

In  this  study,  it  was  hypothesized  that  by  in¬ 
creasing  Che  number  of  task  lines  to  be  monitored  sim¬ 
ultaneously  a  decrease  in  performance  would  result  due 
to  increased  demands  on  the  information  processing  and 
action  selection  capabilities  of  the  human.  Fig.  8 
shows  the  performance  achieved  by  both  the  model  and 
the  human.  As  expected  the  percentage  of  reward 
earned  decreases  as  more  task  lines  are  presented  to 
the  decisionmaker.  The  human  again  outperforms  the 
model  as  the  number  of  task  lines  increased  beyond  the 
nominal  condition  of  five  lines.  This  appears  to  be 
due  to  the  fact  that  the  human  adjusts  his  behavior  in 
an  attempt  to  keep  his  performance  constant  when  the 
number  of  tasks  to  be  monitored  simultaneously  ap¬ 
proached  his  short  term  memory  capacity  (5-7  lines). 

The  plot  of  the  number  of  tasks  completed  by  both 
the  model  and  the  human  as  a  function  of  the  number  of 
task  lines  shows  the  so-called  inverCed-U  shape  hypoth¬ 
esized  to  exist  under  conditions  of  high  workload.  The 
experimental  data,  however,  indicates  a  leveling  off 
in  the  number  of  tasks  completed  by  the  human;  achieved 
by  adjusting  his  behavioi  to  account  for  the  increase 
in  the  number  of  task  lines,  (i.e.  maintaining  a  sub¬ 
jective  level  of  performance). 

Also  Indicated  in  Fig.  8  is  that  the  human  is  out¬ 
performing  the  model  in  terms  of  the  probability  of 
conmltting  an  error  by  the  model  is  incieasiug  «s  a 
function  of  the  number  of  lines  monitored,  whereas  the 
human  maintains  a  relatively  constant  low  error  level. 
This  fact  can  explain  the  constant  human  performance 
phenomenon  exhibited  earlier,  in  the  sense  that  the 


human,  by  trying  to  process  less  information  as  the 
workload  Increases,  is  able  to  commit  fewer  errors  and 
therefore  to  achieve  a  higher  performance  than  the 
model  prediction. 
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Fig.  8  SENSITIVITY  TO  THE  NUMBER  OF  CHANNELS 
IV  AN  A  PRIORI  MEASURE  OF  TASK  DIFFICULTY:  THE  p  RATIO 

As  a  result  of  the  sensitivity  program  described 
before,  in  particular  the  study  of  sensitivities  to 
velocity  and  processing  time  range,  it  was  hypothetized 
that  the  ratio  p 


T 

_R  m  Average  time  required 
P  =  Average  time  available 

TA 

is  a  suitable  measure  of  a  priori  task  difficulty  ex¬ 
hibited  by  a  specific  dynamic  task  in  the  current  ex¬ 
perimental  paradigm.  It  can  be  easily  shown  that  the 
p  factor  is  given  by  the  following  formula: 


P  -  •  a(l-B2) 

rn  is  the  maximum  number  of  unit  time  (dots) 
per  task 


where 


is  normalized  velocity 


V  is  the  average  nominal 
velocity 
_  L 


nx<5 


is  a  screen 
window  limita¬ 
tion 


L:  length  of  the 
opportunity 
window 


AV 


B  "  —  represents  the  un- 
V  V  certainty  on  the  velocity 
(uniform  distribution) 


time  value  of  a 
unit  time  (in 
seconds) 


Fig.  9  depicts  a  map  of  theoretical  curves  of 
equal  p  for  different  combinations  of  a  and  S.  The 
hypothesis  that  the  average  factor  0‘f^/T^  correlates 
with  workload  and  affects  degradation  of  performance  Is 
verified  by  model  simulation  as  well  as  by  experimental 
results  also  shown  in  Fig.  9  for  the  case  n«5,  a-1/2 
(nominal).  It  can  be  seen  that  an  increase  in  c  leads 
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a  decrease  In  performance. 


.<  .6  .»  1.0 


\ln  conclusion  rhe  experimental  results  allow  us  to 
assume  that  the  p  factor  can  represent  an  appropriate 
aprlorV  measure  of  task  difficulty  in  the  current  dyna¬ 
mic  environment . 

v  Objective  workload  and  effectiveness 

In  an  effort  to  compare  the  level  of  workload 
Imposed  upon  Nje  human  with  that  perceived  by  the  human, 
rankings  of  subjective  workload  were  calibrated  with 
subject's  rank  ordering  of  workload  along  the  three 
dimensions  of  Time  Load,  Mental  Effort  Load  and  Stress 
Load.  The  methodology  used  was  the  Subjective  Workload 
Assessment  Technique  (SWAT)  well  documented  in  [6].  The 
experimental  setting  used  was  the  paradigm  presented  in 
part  I  and  the  method  of  controlling  imposed  workload 
was  by  increasing  the  number  of  task  channels  from  2  to 
10,  all  other  parameters  (velocities,  task  values  and 
processing  time  ranges)  remaining  fixed.  As  expected, 
the  average  workload  perceived  by  the  subjects  increased 
linearly  with  the  number  of  task  lines  to  be  monitored 
(Fig.  11). 
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Fig.  9  (a)  p  curves,  (b)  performance  vs.  p 

A  sensitivity  study  was  performed  on  the  model  and 
Iso  experimentally  to  check  the  assumption  that  for 
onstant  p  the  performance  of  the  decisionmaker  remains 
■onstant,  although  the  range  of  dynamics  parameters 
;uch  as  velocity  and  dot  time  value  varied  In  a  wide 
range.  Fig.  10  confirms  this  hypothesis:  the  experimen- 
;al  daca  shows  that  the  human  develops  a  strategy  to  . 
lalntain  his  performance  relatively  constant  when  the 
iverage  a  priori  workload  (p  factor)  is  kept  constant, 
tote  that  when  the  value  of  the  dots  is  <  .25  sec,  the 
serformance  degrades,  probably  due  to  the  fact  that  we 
»re  very  close  to  human  time  delay  limitations.  The 
•odel,  however,  does  not  achieve  a  constant  performance 
level,  but  rather  exhibits  a  degradation  in  performance, 
this  degradation  yields  an  Increase  in  the  average  error 
probability  unlike  the  relatively  small  and  constant 
average  error  probability  observed  in  the  experimental 
data*  Perfomwnc*  for  constant  9 

1  (varying  Join Jy  dot  value  and  velocity  range) 
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Fig.  11  PERCEIVED  AND  IMPOSED  WORKLOAD 

Corresponding  to  this  increase  In  perceived  work¬ 
load,  a  change  in  human  strategy  under  high  workload 
conditions  was  noticed  in  conjunction  with  the  perfor¬ 
mance  level  attained  (recall  Fig.  8).  This  change  of 
strategy  is  depicted  in  Fig.  12,  where  an  effectiveness 
factor  e  given  by 

E  «  f  average  value  of  tasks  processed  ,"J  j 

J_average  value  of  tasks  presented  J  x 

is  plotted  as  a  function  of  subjective  workload  for  the 
same  experiment.  Essentially  this  effectiveness  factor 
indicates  that  the  human  adapts  a  more  discriminating 
strategy  (acting  on  tasks  of  average  task  value  higher 
than  the  average  of  those  presented  to  him)  when  he  is 
.confronted  with  increasing  information  processing  de¬ 
mands.  This  discriminating  behavior  accounts  for  the 
ability  of  the  human  to  outperform  the  DDM  as  discussed 
earlier  in  adapting  a  strategy  for  high  workload  condi¬ 
tions.  Fig.  12  shows  clearly  the  'jump'  of  operative 
behavior  in  terms  of  average  effectiveness  between  low 
and  high  workload  conditions. 

Fig.  12  Sffectlvene.t  Factor  v*.  Subjective  Wortload 
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VI  SUMMARY  AND  CONCLUSIONS 

In  this  joint  experimental-analytical  study,  per¬ 
formance  obtained  by  the  Dynamic  Decision  f.-’del  (DDM) 
and  human  subjects  in  the  experimental  paradigm  of  Fig. 

3  was  examined  as  a  function  of  four  task  attributes 
which  Included  different  task  velocities,  different 
task  values,  different  task  processing  times,  and 
varying  the  number  of  task  lines  to  be  monitored.  Re¬ 
sults  of  the  study  showed  a  general  agreement  with 
existing  hypotheses  regarding  human  performance  as  a 
function  of  the  four  attributes  mentioned.  The  primary 
factor  affecting  performance  was  that  of  task  velocity 
as  indicated  in  the  time  available/time  required  trade¬ 
off  of  traditional  operator  workload  research.  In  addi¬ 
tion,  it  was  noted  that  the  human  operator  was  able  to 
successfully  discriminate  various  undesirable  tasks 
from  desirable  ones  as  a  means  to  keeping  performance 
consistent.  This  may  be  due  to  a  somewhat  conservative 
threshold  -  strategy  employed  by  the  decisionmaker,  but 
in  general  it  is  felt  to  be  consistent  with  one's 
Intuition. 

The  results  of  the  model  simulation  study  also 
indicate  that  a  decrease  in  performance  can  be  obtained 
not  only  from  Increased  information  processing  demands, 
but  also  from  situations  in  which  action  selection 
uncertainty  (i.e.  conflicts)  exist.  This  was  most 
clearly  represented  in  the  study  Involving  the  Increased 
number  of  tack  lines  to  be  monitored  simultaneously.  In 
that  study  it  has  been  shown  that  beyond  a  certain 
number  (-7)  of  task  lines  (channels),  the  operator  per¬ 
formance  decreases  sharply  due  to  the  increasing  con¬ 
flict  in  action  selection  and  limited  information  pro¬ 
cessing  capability. 

However,  in  the  experiments,  the  subjects  showed 
different  performance  in  the  high  workload  condition. 
Instead  of  decreasing,  the  subject's  performance  was 
maintained  at  a  quasi- constant  level  due  to  adaptation 
In  operative  behavior  in  order  to  (1)  process  less  in¬ 
formation,  (2)  commit  fewer  errors,  and  (3)  maintain  a 
subjective  level  of  performance  across  high  workload 
environments.  In  diagram  from  the  mechanism  by  which 
decisions  are  made  by  the  human  appears  to  be  that  of 
the  following: 
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Fig.  13  Anatomy  of  a  decision 

Another  contribution  of  the  present  effort  is  the 
proposition  of  an  objective  measure  of  a  priori  task 
difficulty  in  terms  of  the  factor  p.  This  measure 
(0  *  ^as  keen  f°und  to  be  adequate  in  the 

existing  experimental  paradigm,  although  more  research 
will  have  to  be  done  to  improve  the  measure  in  different 
dynamic  environments.  Subjective  workload  was  found  to 
correlate  very  well  with  objective  workload  imposed  by 
an  experiment  and  an  interesting  find  was  that  humans 
tend  to  be  more  efficient  in  reasonably  high  workload 
conditions. 
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Introduction 

The  purpose  of  this  paper  is  to  demonstrate  the 
indamental  Importance  of  individual  differences  among 
llitary  personnel  in  their  information-seeking  and 
recessing  behavior  when  faced  with  critical  decisions 
a  a  realistic  task.  The  data  reported  ir  this  paper 
ere  collected  at  sea  in  the  mid  1960s.  The  analysis 
f  this  data  base  has  provided  an  object  lesson  showing 
he  importance  of  empirical  data  to  decisionmaking 
esearch. 


Determinants  of  Decisionmaking  Behavior 

In  general,  the  major  determinants  of  decislon- 
aklng  behavior  Include  such  factors  as  information 
resentatlon  rate,  perceived  problem  complexity,  per- 
elved  time  available  for  decisionmaking,  the  number 
nd  quality  of  perceived  alternatives,  and  the  per- 
elved  risks  (Sage  111,  Sage  and  White  (2),  Pattlpatl, 
t.  al  [ 3 1 ,  and  Wohl  |5|.*  In  this  paper,  however,  we 
hall  examine  two  other  factors  for  which  empirical 
ata  are  not  often  available.  These  are:  1)  indivld- 
al  differences  in  cognitive  and  decision  styles;  and 
)  level  of  expertise  attained. 

The  Operational  Problem  in  Active  Sonar 
Classification 

A  modern  active  sonar,  such  as  the  SQS-23  or  SQS- 
!6,  emits  an  extremely  strong  pulse  of  low  frequency 
lound  into  the  water  on  a  nearly  omni-directional  ba¬ 
ils.  The  outgoing  sound  pulse  is  diminished  in  inten- 
ilty  by  both  the  inverse  square  law  and  the  absorption 
»f  sound  in  water.  It  is  reflected  by  surface  wave 
fronts,  bathypelaglc  fish,  sea  bottom  anomalies, 

■chools  of  shrimp,  whales,  and  floating  debris  as  well 
is  real  submarines,  and  is  returned  to  the  receiving 
iqulpment  after  having  suffered  another  set  of  identi- 
:al  losses  on  the  return  trip.  In  the  receiver,  this 
ilgnal  information  is  mixed  with  noise  from  several 
lources  including  own  ship  and  other  ships  as  well  as 
ocean  sources. 

The  resulting  mixture  of  signals  and  noise  may  be 
presented  to  the  sonar  operator  in  several  display  for- 
sats:  a  PPI  display,  a  tactical  range  or  time  history 
recorder,  and  a  pair  of  audio  headphones.  The  sonar 
operator  must  examine  the  return  Information  from  each 
pulse  or  "ping”,  and  as  more  and  more  information  is 
accumulated  on  the  sonar  contact,  he  is  required  to 
make  and  report  a  decision  as  to  whether  the  received 
information  represents  l)  a  non-submarine  contact,  2)  a 
possible  submarine  contact,  or  1)  a  probable  submarine 
contact.  This  decision  is  made  in  conjunction  with  his 
watch  supervisor  and/or  the  antisubmarine  warfare 
officer  on  board  the  ship. 


•References  are  Indicated  by  number  In  square  brackets 
and  appear  at  the  end  of  this  paper. 


This  classification  decision  is  only  one  of  sev¬ 
eral  activities  which  the  sonar  operator  must  perform. 
These  activities  include  control  of  the  sonar  search 
process  itself,  monitoring  of  the  display  and  detection 
of  possible  signals  against  the  noise  background, 
tracking  of  these  signals  over  a  period  of  time  to 
determine  their  correlation  and  consistency,  and  ulti¬ 
mately  classification  Itself. 

The  fundamental  classification  decision  must  an¬ 
swer  the  question,  'is  this  contact  sufficiently  like  a 
submarine  to  be  classified  as  a  submarine?"  There  are 
five  basic  subquestions  involved:  1)  Is  the  contact 
truly  moving?  2)  What  kind  of  reflective  structure 
does  it  have?  3)  How  large  is  it?  A)  What  is  its 
shape  or  aspect?  5)  Does  It  have  depth  below  the  ocean 
surface?  In  addition,  strong  correlations  among  the 
answers  to  these  five  subquestions  lend  further  cre¬ 
dence  to  one  or  another  of  the  possible  hypotheses. 

The  Experiments 

A  U.S.  Navy  destroyer  spent  a  number  of  weeks  at 
sea  during  the  mid-1960's  collecting  a  representative 
sampling  of  submarine  and  nonsubmarine  contact  infor¬ 
mation  on  a  modern  active  scanning  sonar.  Some  sixty 
hours  of  contacts  were  recorded  of  which  approximately 
602  were  submarine  contacts  and  401  nonsub  contacts. 

The  data  were  captured  directly  off  of  the  sonar  pre¬ 
amplifiers  onto  54  reels  of  56-channel  tapes.  The  sub¬ 
marine  contacts  were  made  using  friendly  submarines 
under  strict  control  in  order  to  obtain  information  at 
various  ranges,  depths,  speeds  of  both  submarine  and 
own  ship,  and  submarine  aspect  angles,  as  well  as  under 
a  variety  and  range  of  environmental  and  sea  condi¬ 
tions.  The  nonsubmarine  contacts  Included  identified 
fish,  porpoises,  whales,  bottom  wrecks ,  and  one  float¬ 
ing  log.  The  information  was  collated  by  a  group  at 
the  Applied  Research  Laboratory  of  the  University  of 
Texas. 

From  this  collated  information  a  realistic,  100- 
item  test  was  constructed  consisting  of  fifty  subs  and 
50  nonsub  contacts  and  as  much  as  25  to  40  "pings'  per 
contact,  all  taken  under  a  representative  range  of  sea 
conditions  including  sea  state,  wind  velocity,  wind  di¬ 
rection,  layer  depth,  bottom  type,  and  bottom  depth. 

The  test  tapes  were  then  presented  under  highly  con¬ 
trolled  conditions  on  real  sonar  equipment  to  37 
Pacific  Fleet  sonar  operators.  These  subjects  repre¬ 
sented  a  wide  range  of  experience,  measured  in  terras  of 
years  of  Naval  service,  number  of  sonar  teaching  jobs, 
number  of  sonar  schools  attended,  number  of  ship 
assignments ,  total  years  of  sonar  experience  and  years 
of  experience  with  the  sonar  used  in  this  experiment. 

In  addition  to  the  regular  sonar  equipment  with 
its  displays  and  controls,  the  subjects  were  given  ten 
push  buttons  labeled  0  through  4  and  6  through  10  and 
instructed  to  yress  number  10  if  absolutely  certain  of 
of  a  submarine,  number  0  if  absolutely  certain  of  a 


nonsubmarlne ,  and  nuabers  1  through  9  as  appropriate 
on  each  pi* g.  Omitting  the  number  5  from  the  group  of 
push  buttons  essentially  produced  a  scaled  forced 
choice  situation.  Note  that  pushing  a  button  Is  es¬ 
sentially  the  same  as  a  subject  reporting  his  posterior 
probability  that  his  response  resulted  from  the 
presence  of  a  submarine.  Thus,  this  method  Is  a  mech¬ 
anization  of  Green  and  Suets'  [5]  subjective  confidence 
rating  method  for  deriving  Receiver  Operating  Charact¬ 
eristic  (ROC)  curves.  It  efrectlvely  compresses  multi¬ 
ple  unknown  dimensions  (l.e.,  the  various  sonar  cues 
and  the  weights  given  to  them  by  the  subjects)  into  a 
single  subjective  dimension.  Both  the  experiment  and 
the  results  are  described  In  Uohl  et  al  [6]  and  Nacht 
and  Nohl  [ 7 ) . 

Experimental  Results 

Using  this  method,  ROC  curves  were  developed  for 
each  of  the  37  sonar  operators.  It  was  quickly  ap¬ 
parent  that  the  operators  fell  Into  two  groups  which 
could  be  labeled  "average"  and  "expert".  The  single 
exception  to  this  categorization  was  the  one  subject 
who  exhibited  a  negative  ROC  curve.  Figure  1  shows  Che 
combined  results  for  the  "average"  group  of  33  oper¬ 
ators.  Note  that  the  performance  Is  little  better  than 
random,  and  Chat  accumulation  of  additional  Information 
had  no  effect  on  performance  for  this  group. 
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Figure  1.  Average  ROC  Curves  of  "Average"  Group 

P(s/S)  •  probability  that  sonar  operator  says  "sub" 
when  contact  is  a  sub  (“hit"  probability) 

P(s/NS)  ■  probability  that  sonar  operator  says  "sub" 
when  contact  la  a  non-sub  ("false  alarm" 
probability) 

By  way  of  contrast,  the  average  curves  of  the 
highest  performance  or  "expert"  group  of  3  operators  Is 
shown  In  figure  2.  Here  it  Is  clear  that  these  oper¬ 
ators  were  able  to  make  use  of  the  additional  infor¬ 
mation  In  Improving  their  discrimination  capability. 

In  order  to  determine  the  effect  of  individual 
differences  on  classification  performance,  the  data  for 
each  subject  were  aggregated  in  the  following  way. 
Pressing  any  of  the  buttons  0  through  4  was  takun  as  a 
nonsub  response,  while  pressing  any  of  the  buttons  6 
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Figure  2.  Average  ROC  Curves  of  Highest  Performance 
or  "Expert"  Group 

P(s/S)  ■  probability  that  sonar  operator  says  "sub” 
when  contact  is  a  sub  Chit"  probabilitv) 

P(s/NS)  »  probability  that  sonar  operator  says  "sub" 
when  contact  Is  a  non-sub  ("false  alarm* 
probability) 

through  10  was  taken  as  a  sub  response.  The  perform¬ 
ance  results  were  then  plotted  in  terms  of  "hit''  versus 
"false  alarm"  scores.  These  two  dependent  variable 
scores  were  then  subjected  to  multiple  correlation 
analysis  with  the  six  Independent  variables  of  training 
and  experience  mentioned  previously.  Unfortunately, 
training  and  experience  data  for  seven  of  the  37  sub¬ 
jects  were  not  available.  For  the  remaining  30,  the 
analysis  results  showed  an  absence  of  significant 
correlation  between  performance  and  experience. 

The  data  for  the  30  remaining  subjects  were  then 
divided  into  two  groups  labeled  "unbiased"  and 
"biased”.  The  criterion  for  categorization  is  evident 
in  the  differences  between  figures  3  and  4.  Figure  3 
shows  the  aggregated  classification  performance  for  a 
typical  "unbiased”  sonar  operator.  The  point  labeled 
number  1  In  figure  3  represents  the  average  performance 
for  the  first  "ping"  across  all  100  test  items  for  that 
subject,  point  number  2  for  the  second  “ping",  and  so 
forth.  Of  the  30  subjects  for  whom  experience  data 
were  available,  10  showed  "ui  biased"  characteristics 
similar  to  figure  3  while  20  showed  "biased"  character¬ 
istics  similar  to  figure  4.  The  term  "biased"  Is  used 
here  to  indicate  the  fact  that  operators  exhibiting 
this  characteristic  almost  always  reported  "nonsub"  on 
the  first  "ping";  l.e.,  they  would  almost  always  tend 
to  push  buttons  0  through  4  on  the  first  "ping".  In 
addition,  they  exhibited  a  delay  of  between  5  and  IS 
"pings"  before  their  bias  disappeared;  l.e.,  they 
seemed  to  require  an  accumulation  of  between  5  and  IS 
"pings"  worth  of  sonar  information  to  overcome  their 
bias. 

At  first  this  was  believed  to  be  the  normal  result 
of  more  extensive  experience  at  sea  on  the  part  of  the 
"biased"  operators.  Since  most  at-sea  sonar  contacts 
with  active  sonars  are  indeed  found  to  be  nonsubs,  one 
would  expect  the  development  of  such  a  base  rate  bias 
over  a  period  of  long  service. 
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Figure  3.  Classification  Performance  Curve  of 
Typical  "Unbiased"  Sonar  Operator 
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Figure  4.  Classification  Performance  Curve  of 
Typical  "Biased"  Sonar  Operator 

This  hypothesis  was  then  tested  by  subdividing  the 
group  of  30  sonar  operators  lnto"blased"  and  "onbiaoed" 
subgroups  and  again  running  the  multiple  correlation 
analysis  with  the  six  experience  factors  as  before. 

The  results  of  this  analysis  are  shewn  in  figure  5. 
Here,  the  multiple  correlation  coefficient  Is  plotted 
against  the  accumulation  of  Information  In  terms  of 
number  of  "pings"  on  each  contact.  From  figure  5,  it 
la  clear  that  the  "unbiased"  subjects  (represented  by 
the  upper  curve  scores)  showed  high  and  significant 
correlation  with  experience,  regardless  of  information 
accumulation,  as  might  be  expected  of  "normal"  people. 
On  the  other  hand,  the  20  "biased"  subjects  showed  s 
low  correlation  with  experience  until  at  least  5  to  15 
"pings"  of  Information  had  been  accumulated. 


Figure  5.  Correlation  of  Experience  with  Classifica¬ 
tion  Performance  For  "Biased"  and  "Unbiased" 
Sonar  Operators 


Discussion  and  Conclusions 

From  the  foregoing  data  analysis,  several  conclu¬ 
sions  can  be  reached.  First,  "bias’  Is  not  correlated 
with  experience;  rather  It  appears  to  be  Inherent  In 
the  subject.  Secondly,  the  presence  of  bias  results  In 
minutes  of  lost  time  In  an  operational  setting;  It 
should  be  noted  that  5  to  15  "plngi"  represents  an  ad¬ 
ditional  3000  to  9000  yards  headway  made  by  an  attack¬ 
ing  submarine  before  action  Is  taken.  Thirdly,  even 
though  "biased"  operators  ace  evidently  trying  to  re¬ 
duce  their  false  alarm  rate  (see  figure  4),  the  cor¬ 
relation  results  shown  in  figure  5  indicate  that  biased 
operators  weie  just  as  biased  about  hits  as  about  false 
alarms. 


It  is  clear  that  a  deeper  understanding  of  this 
type  of  judgment  bias  and  Its  operational  Impact  is 
critical  to  a  number  of  naval  activities  Including  per¬ 
sonnel  selection  and  assignment,  command  and  control 
system  design,  and  decision  support  system  design. 

Since  passive  sonar  Is  the  primary  ocean  surveillance 
in  current  u-,e  today,  the  study  results  reported  he»"»<t! 
are  primarily  of  historical  Interest.  But  they  demon¬ 
strate  the  critical  role  that  empirical  data  can  and 
must  play  in  understanding  human-machine  interaction 
and  in  designing  for  it.  Such  data  are  needed  for 
today's  critical  Navy  decisionmaking  tasks  such  as 
those  of  the  specialized  Warfare  Commanders,  the  Com¬ 
bined  Warfare  Commander,  the  Tactical  Action  Officer, 
and  those  who  select  and/or  modify  firing  doctrine  and 
rules  of  engagement.  Such  data  will  help  us  not  only 
to  better  understand  these  decisionmaking  functions, 
but  will  also  provide  an  Improved  basis  for  l)  building 
and  validating  decision  models,  2)  designing  better 
systems,  and  3)  making  Improved  human  and  system  em¬ 
ployment  decisions. 
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Summa ry  --  Nonmetric  multidimensional  scaling 
(MDS)  techniques  were  employed  to  determine  how  sonar 
information  is  organized  and  assigned  priorities  by  the 
Submarine  Conning  Officer  (CONN).  Data  were  collected 
from  95  Submarine  Officers  with  varying  amounts  of  at- 
sea  experience.  All  types  of  information  proposed  for 
display  in  modern  sonar  systems  were  classified  by  the 
investigators  into  15  categories.  Descriptions  of  the 
categories  comprised  the  stimuli  for  the  two  tasks  the 
subjects  performed.  In  an  unconstrained  sorting  task 
subjects  sorted  the  15  stimuli  into  groups  according  to 
similarity  of  the  sonar  information  described,  to 
provide  data  for  the  MDS  analysis.  In  a  ranking  task, 
subjects  rank  ordered  the  stimuli  according  to 
importance  at  CONN.  The  MDS  analysis  provided  evidence 
that  the  officers  organize  sonar  information  in  two 
dimensions,  related  to  Information  Source  and  Informa¬ 
tion  Destlntlon,  while  the  rank  order  data  indicated 
that  most  Importance  was  attached  to  the  information 
at  the  extremes  of  these  dimensions.  Significant 
agreement  was  found  among  all  subjects,  regardless  of 
experience  level,  in  the  way  the  sonar  information  was 
p  chologically  organized  and  prioritized. 

Recent  technological  developments  have  made  the 
human-machine  Interface  increasingly  complex  in  terms  of 
the  kind  and  amount  of  information  available  and  how  it 
1b  displayed.  The  Naval  Submarine  Medical  Research 
Laboratory  has  addressed  some  of  the  resultant  problems 
in  the  design  and  operation  of  automated  Information 
systems,  in  particular,  submarine  sonar  systems.  One 
phase  of  this  project  has  been  to  identify  those  pieces 
of  sonar  information  that  are  perceived  by  the  Conning 
Officer  (at  the  "CONN"),  who  is  Immediately  in  charge 
of  ship  operations,  to  be  most  useful  in  ship  control. 
Current  hardware  makes  it  possible  to  display  any  or 
all  sonar  information  at  the  CONN,  from  raw  auditory 
data  to  refined  visual  displays  of  predicted  ships' 
positions.  Two  Important  considerations,  however,  may 
dictate  that  less  Information  be  provided  than  is 
technologically  possible.  One  of  these  is  financial, 
in  terms  of  hardware  and  software  costs.  The  other, 
which  this  research  addresses,  is  the  limitation  by 
human  information  processing  capacities,  since  many 
information  processing  theorists  consider  too  much 
information  a  source  of  performance  degradation  (e.g., 

HI). 

There  exist  a  number  of  different  approaches  to 
identifying  and  prioritizing  the  sonar  information  that 
should  be  displayed  at  CONN,  but  each  has  its  associ¬ 
ated  problems.  It  has  been  our  experience,  for 
example,  that  judgments  by  systems  engineers  frequently 
are  not  well  received  by  the  operational  forces,  and 
that  simple  polls  of  experienced  submarine  officers 
often  yield  equivocal  results.  More  meaningful 
information,  on  the  other  hand,  could  be  obtained  by 
empirical  assessment  of  alternatives  during  real  or 
simulated  operations,  but  such  an  approach  can  be 
expensive  and  time-consuming.  In  lieu  of  these 
approaches,  Zachary  (2]  has  employed  nonmetrlc  multi¬ 


dimensional  scaling  techniques  [3)-[6]  in  the  context 
of  Naval  Air  antisubmarine  warfare  (ASW)  in  prioriti¬ 
zing  decision-making  situations.  Such  techniques  have 
been  applied  in  the  present  study  to  judgments  about 
sonar  information,  to  determine  how  such  information 
is  organized  and  assigned  priorities  by  the  submarine 
Conning  Officer. 

Method 

Subjects 

Data  were  collected  from  95  Naval  Officers  in  the 
New  London  area.  In  order  of  decreasing  seniority  and 
experience,  the  sample  consisted  of  11  Commanding 
Officers,  16  Executive  Officers,  and  30  junior  men 
qualified  as  Officers  of  the  Deck,  from  eight  fast 
attack  (SSN)  and  eight  fleet  ballistic  missile  (FBM) 
submarines.  In  addition,  38  junior  officers,  who  had 
tecently  completed  the  Submarine  Officer's  Basic  Course 
at  the  Naval  Submarine  School,  participated.  This  last 
group,  in  general,  had  no  at-sea  experience. 

Stimuli 

The  various  types  of  information  available  from 
current  and  proposed  sonar  systems  were  classified  by 
the  investigators  into  15  categories,  as  listed  in 
Table  I.  Descriptions  of  these  categories  comprised 
the  15  stimuli  for  the  tasks  to  be  performed.  Discus¬ 
sions  with  sonar  instructors  indicated  that  the 
selected  categories  were  exhaustive  of  the  types  of 
sonar  information  that  could  be  presented  at  CONN. 

Each  of  the  stimuli  vas  typed  onto  a  separate  card, 
numbered  on  the  reverse  side,  to  create  the  stimulus 
deck.  A  questionnaire  administered  after  the  data 
collection  confirmed  that  the  categories  were  meaning¬ 
ful  and  that  no  Important  piece  of  information  had  been 
omitted. 

Procedure 

To  provide  data  for  the  multidimensional  scaling 
analysis,  subjects  were  first  asked  to  perform  an 
unconstrained  sorting  task,  arranging  the  stimuli  into 
as  many  or  as  few  groups  as  they  felt  necessary, 
according  to  similarity.  Tire  definition  of  similarity 
was  left  up  to  the  subject.  Cards  which  described 
similar  categories  were  to  be  placed  in  the  same  group, 
and  any  card  which  described  a  unique  category  was  to 
be  placed  by  itself.  Then,  to  provide  additional  data 
for  interpreting  the  scaling  analyses,  subjects  were 
asked  to  rank  order  the  stimuli  according  to  importance 
at  the  CONN  for  two  different  operational  missions.  The 
first  mission  assumed  an  SSN  on  an  ASW  direct  support 
patrol.  In  such  a  scenario,  own  ship  would  seek  out 
and  follow  enemy  submarines.  The  second  mission 
assumed  an  FBM  patrol  in  an  area  where  a  high  density 
of  sonar  contacts  was  expected.  In  this  scenario,  own 
ship  would  remain  in  a  designated  area  and  try  to  avoid 
detection  by  enemy  vessels.  Subjects  were  instructed 
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then  Co  mark  their  rank-ordered  lists  to  show  which 
options  were  necessary,  merely  desirable,  or  unneces- 
aary.  Alter  each  task,  subjects  recorded  their  data  on 
answer  sheets  according  to  the  code  number  on  the  back 
of  each  stimulus  caru. 

Due  to  time  constraints  in  obtaining  data  from 
these  subjects,  the  sorting  procedure  was  used  in  lieu 
of  the  pairwise  Judgment  of  similarity  often  employed 
in  this  type  of  analysis.  Data  were  usually  collecte.d 
from  small  groups  of  subjects,  such  as  one  ship's  crew, 
in  sessions  lasting  approximately  one  hour. 

Results  and  Discussion 

The  data  from  all  subjects  for  the  unconstrained 
sorting  task  were  entered  into  a  computer  program  which 
produced  a  dissimilarities  oatrlx  for  the  13  stimuli, 
assigning  values  to  the  103  pairs  of  stimuli  according 
to  the  number  of  times  subjects  placed  them  in  the  same 
group.  This  initial  procedure  thus  produced  a  proxim¬ 
ities  matrix  from  the  nominal  scale  sorting  data.  The 
resultant  matrix,  in  turn,  was  the  input  to  the  KYST-2A 
multidimensional  scaling  program  [7].  Through  this 
technique  as  employed  here,  a  configuration  of  points 
(stimuli)  in  Euclidean  space  is  constructed  by  an 
iterative  adj*  stment  process,  based  on  the  observed 
dissimilarity  between  all  pairs  of  stimuli.  Tne  final 
configuration  is  then  rotated  so  that  the  principal 
components  of  the  points  lie  along  the  coordinate  axes. 
The  object  of  this  procedure  is  to  help  determine  the 
underlying  psychological  structure  of  the  stimulus 
domain,  namely  the  various  pieces  of  sonar  Information. 


TABLE  I 

The  categories  of  sonar  information  that  comprised 
the  stimuli.  The  numbers  indicate  the  aggregate  rank 
ordering  by  importance,  and  the  partitions  indicate 
the  degree  of  necessity,  for  all  subjects. 


Rank  Kind  of  Information  Necessity 

1.  Contact  Summary  -  Ceographlc 

2.  Contact  Summary  -  Tabular 

3.  Single  Contact  Data  NECESSARY 

4.  Tactical  Aids 

5.  Own  Ship  Data 


6.  Contact's  Active  Sonar 

7.  Raw  Visual  Displays 

8.  Ocean  Acoustic  Parameters 

DESIRABLE 

9.  Ranging  Data 

10.  Classification  Aids 

11.  Environmental  Parameters 


12.  Passive  Sonar  Setup 

13.  Raw  Auditory  Signals 

14.  Active  Sonar  Setup 
13.  Sonar  Hardware  Status 


UNNECESSARY 


The  computer  analysis  was  repeated  with  10  different 
starting  configurations  to  ensure  that  the  obtained 
solution  was  a  result  of  the  stress  value  reaching  a 
global,  rather  than  a  local,  minimum. 

The  resulting  two-dimensional  solution  is 
presented  in  Figure  1 ,  with  the  number  of  dimensions 
selected  according  to  the  suggestions  given  by  Shepard 
[8],  These  Included  consideration  of  data  values  in 
the  dissimilarities  matrix,  stress  values  for  other 
dimensionalities,  and  meaningfulness  in  the  interpre¬ 
tation  of  the  axes. 

The  labeling  of  the  dimensions  in  a  multidimen¬ 
sional  scaling  configuration  is,  for  the  most  part, 
based  on  the  information  available  to  the  investigators 
about  the  set  of  stimuli  being  scaled.  Examination  of 
Figure  l  leads  us  to  believe  that,  at  least  for  the 
sorting  data  we  obtained,  the  officers  organize  sonar 
information  in  terms  of  no  more  than  two  basic 
dimensions:  data  concerning  sources  of  information,  as 
shown  along  the  vertical  axis,  and  data  related  to  the 
destination  of  that  information,  as  given  along  the 
horizontal  axis.  The  two  extremes  of  the  Information 
Source  dimension  are  delimited  by  information  from  the 
world  external  to  the  submarine.  At  one  end  are 
auditory  and  visual  displays  of  the  relatively 
unprocessed  sonar  signals  arriving  at  the  ship’s  hyrdo- 
phone  arrays,  obtained  in  passive  mode  from  noire 
generated  by  the  sot.ar  contact.  Also  here  lies  infor¬ 
mation  about  the  contact  derived  from  any  active  sonar 
transmission  the  contact  makes.  At  the  other  end  of 
the  scale  is  Information  about  the  environment  which 
bears  on  sonar  performance,  such  as  sea  state,  ocean 
depth,  and  computed  parameters  for  the  acoustic 
properties  of  the  surrounding  ocean  area.  Information 
about,  and  derived  by,  own  ship  lies  between  the  out¬ 
side  world  extrema.  Hence,  this  axis  can  be  labeled 
as  Contact  versus  Environment. 

The  Information  Destination  axis  is  concerned  with 
where  in  own  ship,  the  submariner's  inside  world,  the 
available  information  is  directed.  The  axis  is 
delimited  at  one  end  by  factors  relevant  to  the  CONN, 
which  Influence  the  maneuvering  of  own  ship:  a  table 
listing  all  contacts  and  their  classlf icatlor ,  such  as 
friendly  or  hostile,  surface  or  submerged;  a  geographic 
picture  showing  the  positions  of  contacts  in  relation 
to  own  ship;  and  displays  showing  predicted  future 
positions  of  contacts  and  the  effects  of  trial 
maneuvers.  At  the  other  end  of  the  dimension  is 
information  relevant  to  the  sonar  personnel:  the 
status  of  own  ship's  sonar  equipment  (performance 
monitoring/f ault  location)  and  the  currert  utilization 
of  the  various  pieces  of  active  and  passive  sonar 
equipment.  This  axis  can  therefore  be  labeled  in 
terms  of  Sonar  versus  CONN. 

The  data  of  all  four  groups  of  subjects  were 
aggregated  for  each  ranking  task  according  to  Impor¬ 
tance,  and  the  Kendall  coefficients  of  concordance  W 
( 9 J  were  computed  to  assess  between-Judge  agreement. 

For  both  rankings,  agreement  was  highly  significant,  as 
indicated  by  the  chi-square  test.  For  the  situation  in 
which  the  submarine  was  acting  in  an  ASW  support  role, 
a  coefficient  of  W  »  .46  was  obtained,  X2(14,  N  »  94)  - 

609.9,  £<.001.  For  the  FBM  patrol  situation,  a  co¬ 
efficient  of  W  »  .44  was  obtained,  X2(14,  [j  »  94)  . 

577.9,  £  <  .001.  “ 

The  rank  order  of  importance  for  the  15  items  of 
sonar  information  was  determined  from  the  sura  of  their 
ranks  from  all  subjects.  This  ranking  for  the  FBM 
patrol  situation  is  given  in  Table  I,  with  the  parti¬ 
tions  according  to  necessity  indicated.  The  ranking 
for  the  ASW  support  role  situation  was  identical  except 
for  a  transposition  of  items  ranked  13th  and  14th,  and 


when  both  rankings  were  combined,  the  ranking  was  as  It  may  be  noted,  however,  that  the  ranking  hv 

shown  in  the  table.  The  ordering  and  categorization  of  importance  follows,  in  some  approximate  manner,  the 
these  stimuli  according  to  their  importance  at  CONN  arrangement  of  the  stimuli  as  one  proceeds  along  the 

appears  quite  reasonable.  Those  items  deemed  necessary  Information  Destination  dimension  from  CONN  to  Sonar, 
are  exactly  those  Important  to  maneuvering  own  ship:  To  determine  if  this  unldinensional  ranking  formed  the 

the  location  of  sonar  contacts  .In  relation  to  own  ship,  underlying  basis  for  the  configuration  given  by  the 

the  classification  of  each  contact,  and  the  motion  of  KYST-2A  scaling,  the  program  was  run  again  using  the 

own  ship.  Those  items  ranked  moderate  in  importance  tank  order  as  the  starting  configuration  for  a  one- 

were  described  as  desirable,  or  nice  to  have,  but  not  dimensional  solution.  As  with  other  hypothesized  unl- 

absolutely  necessary.  These  items  appear  to  be  ones  dimensional  starting  configurations  run  previously,  the 

which  are  less  useful,  in  themselves,  to  CONN  in  stress  value  for  the  one-dimensional  solution  was  not 

operating  own  ship,  but  which  may  help  evaluate  the  improved  beyond  the  value  originally  obtained.  This 

quality  of  information  categorized  as  Necessary.  In  result  further  indicates  that  while  a  meaningful  unl- 

that  regard,  it  is  quite  unexpected  to  find  the  visual  dimensional  ordering  can  be  imposed  on  these  stimuli, 

displays  of  relatively  unprocessed  sonar  signals  to  be  the  underlying  organization  is  yet  two-dimensional.  In 
ranked  as  high  as  seventh.  This  may  indicate  a  addition,  however.  Information  Destination  is  very 

tendency  of  CONN  to  "look  over  the  shoulder"  of  those  likely  the  more  salient  of  the  two  dimensions, 
in  Sonar,  perhaps  Just  to  make  sure  Sonar  is  not 

missing  any  contacts.  Finally,  those  items  labeled  as  To  determine  if  the  four  groups  of  officers  had 

Unnecessary  are  those  concerned  with  the  operation  of  organized  or  ranked  the  sonar  information  differently, 
the  sonar  system,  generally  under  the  complete  purview  complete  separate  anal* ses  as  described  above  for  all 

of  the  Sonar  Supervisor.  subjects  were  computed  on  the  data  from  each  group.  In 

all  cases,  resjlts  indicate  that  a  two-dimensional 

The  numbers  in  Figure  1  show  this  rank  order  solution  was  most  appropriate.  The  KYST-2A  scaling 

written  beside  the  labeled  points  on  the  two-dimen-  configurations  were  very  similar  for  all  groups,  with 

slonal  scaling  configuration.  Those  four  items  ranked  the  stimulus  points  in  slightly  different  positions  in 

most  Important  to,  and  directly  concerned  with,  the  their  respective  quadrants  from  one  group  to  another, 

function  of  CONN,  are  located  together  at  the  appro-  The  one  exception  va9  that  the  Executive  Officers 

prlate  end  of  the  Infor.nation  Destination  dlmeslon.  placed  Own  Ship  Data  closer  to  the  Contact  rather  than 

These  are  followed  by  data  about  Own  Ship  and  Contact's  the  Environment  end  of  the  Information  Source  dimension. 
Active  Sonar,  both  slightly  removed  from  the  CONN 

extremum  and  relatively  distant  from  each  other,  in  the  As  indicated  by  the  significant  coefficient  of 

directions  of  the  ends  of  the  Information  Source  dimen-  concordance  given  above  for  all  subjects,  rankings 

sion.  The  next  two  items  in  importance  are  very  close  between  groups  were  also  rather  similar,  characterized 

to  the  extremes  of  the  Information  Source  dimension,  for  the  most  part  by  transpositions  of  adjacent  stimuli 

the  Raw  Visual  Displays  at  the  Contact  end,  and  Ocean  from  one  group  to  another.  The  notable  exceptions  were 

Acoustic  Parameters  at  the  Environment  end.  Those  Items  that  the  Commanding  Officers  ranked  the  Raw  Visual 

ranked  least  important  relate  to  the  sonar  equipment  and  Displays  second,  in  the  Necessary  category,  and  the 
are  placed  at  the  appropriate  end  of  the  Information  recent  graduates  of  Submarine  School  ranked  that  same 

Destination  dimension.  information  in  the  12th  position,  in  the  Unnecessary 

category.  Commanding  Officers  were  perhaps  reflecting 
the  desire  to  monitor  the  raw  data  In  order  to  confirm 


INFORMATION  DESTINATION 
Sonar  -< - CONN 


Figure  1.  The  two-dimensional  solution  for  the  KYST-2A 
sesling  analysis,  for  all  subjects.  The  numbers  indicate  the 
ranking  by  importance  for  the  15  categories  of  sonar  information. 
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Che  accuracy  of  inferences  represented  by  Che 
categories,  or  Co  be  closely  Involved  with  all  phases  of 
Che  ship's  operation.  The  lease  experienced  group,  on 
the  other  hard,  may  have  been  expressing  recently 
acquired  training  doctrine.  Between- judge  agreemeat  for 
all  groups  was  highly  significant,  with  coefficients  of 
concordance  of  U  *  .42  to  .59  obtained.  Within  groups, 
as  well,  little  difference  was  seen  in  rankings  for  the 
two  different  tactical  missions. 

When  the  rankings  were  compared  between  officers 
assigned  to  FBM  submarines  and  those  assigned  to  SSN 
ships,  again,  little  difference  was  evident  in  the  way 
the  two  groups  ranked  the  stimuli  for  the  two  opera¬ 
tional  missions,  and  the  rankings  followed  the  same 
general  pattern  as  presented  in  Table  I.  In  addition 
to  minor  reversals  in  rankings  between  the  two  groups 
of  officers,  however,  the  FBM  officers  consistently 
ranked  Own  Ship  Data  as  more  important,  third  over  all, 
than  did  the  SSN  officers,  who  ranked  it  seventh.  This 
difference  may  reflect  greater  general  concern  on  the 
part  of  the  FBM  officers  with  their  ship  being  "on 
station,"  consistent  with  the  mission  of  an  FBM  patrol. 
Similarly,  SSN  officers  ranked  Single  Contact  Data  and 
Contact's  Active  Sonar  two  positions  more  Important 
than  the  FBM  officers  did,  possibly  reflecting  consis¬ 
tency  with  the  SSN's  mission.  It  should  be  noted  that 
the  differences  between  these  groups  are  minimized  by 
the  fact  that  the  officers  could  have  had  a  varied 
range  of  experience  on  a  submarine  other  than  the  type 
to  which  they  were  currently  assigned. 

The  rankings  for  the  groups  of  FBM  and  SSN  sub¬ 
marine  officers  were  combined  with  each  other  and 
across  the  two  types  of  missions,  as  well.  When  com¬ 
pared  with  the  ranking  from  the  Submarine  School 
graduates,  the  latter  attached  more  importance  to  the 
Ocean  Acoustic  and  Environmental  Parameters  and  less 
Importance  to  Own  Ship  Data  and,  as  noted  above,  Raw 
Visual  Displays.  It  is  suggested  that  these  differ¬ 
ences  may  reflect  experience  gained  at  sea  versus  the 
aspects  of  ship  operations  emphasized  in  the  Submarine 
Officer's  Basic  Course. 

Summary  and  Conclusions 

This  study  represents  a  successful  application  of 
the  multidimensional  scaling  model,  providing  a  repre¬ 
sentation  of  the  way  in  which  various  pieces  of  sonar 
information  are  organized  In  the  mind  of  the  submarine 
Conning  Officer.  Results  Indicate  that  there  is  sub¬ 
stantial  agreement  among  officers  of  various  levels  of 
experience  regarding  the  way  the  kinds  of  sonar  infor¬ 
mation  are  organized.  There  is  also  agreement  among 
these  groups  in  the  relative  importance  of  these  pieces 
of  information  in  two  different  operational  scenarios, 
both  of  which  yielded  similar  rankings. 

At  least  for  the  data  obtained  from  unconstrained 
sorting  by  similarity,  multidimensional  scaling 
analyses  suggest  that  two  dimensions,  at  most,  are  re¬ 
quired  to  describe  the  Conning  Officers'  conceptuali¬ 
zation  of  the  relations  among  various  types  of  sonar 
Information.  One  dimension  is  related  to  the  source  of 
available  sonar  information,  whereas  the  orthogonal, 
and  primary,  dimension  relates  to  where  in  own  ship 
that  information  is  directed  or  handled.  The  former 
dimension  is  laid  out  according  to  information  from  the 
sonar  contact,  from  own  3hip,  and  from  the  ocean  envir¬ 
onment.  The  primary  dimension  involves  sonar  opera¬ 
tions  at  one  end  and  Conning  Officer's  responsibilities 
at  the  other. 

When  ranked  according  to  importance,  the  informa¬ 
tion  that  the  officers  appear  to  require  most  is  that 
from  the  extremes  of  the  dimensional  axes,  except  for 


that  information  directly  concerned  with  the  sonar 
hardware.  For  system  design,  these  results  suggest 
that  data  about  the  sonar  system  are  least  desired  at 
CONN  and  hence  could  be  omitted  from  the  CONN's  display 
if  financial  or  information  processing  limitations 
dictate  that  ail  information  should  not  be  made  avail¬ 
able.  If  further  restriction  of  kinds  of  data  to  be 
displayed  at  CONN  were  necessary,  investigation  of 
those  types  of  information  in  closest  proximity  to  each 
other  in  the  multidimensional  scaling  solution,  indica¬ 
ting  highly  similar  data,  would  be  appropriate  to 
determine  if  there  are  any  completely  redundant  displays. 
An  hierarchical  clustering  analysis  is  underway  to 
assess  this  redundancy. 

Acknowledgments 

The  authors  wish  to  express  their  gratitude  to  Dr. 
Thomas  Santoro  for  his  computer  assistance,  and  to  Dr. 
Phipps  Arable  for  his  most  helpful  consultations  on 
various  aspects  of  the  multidimensional  scaling 
analyses. 

This  research  was  performed  under  the  Naval 
Medical  Research  and  Development  Command  Research  Work 
Unit  MOlOO-PN-OOl-1015.  The  opinions  and  assertions 
contained  herein  are  those  of  the  authors  and  should 
not  be  construed  as  official  or  necessarily  reflecting 
those  of  the  Department  of  the  Navy  or  the  Naval  Sub¬ 
marine  Medical  Research  Laboratory. 

References 

[1]  I.  L.  Janls  and  L.  Mann,  Decision  making.  New 
York:  Free  Press,  1977,  p.  16. 

{2]  W.  W.  Zachary,  Application  of  multidimensional 

scaling  to  decision  situation  prioritization  and 
decision  aid  design.  Technical  Report  1366-B. 

Willow  Grove,  PA:  Analytics,  1980. 

13]  J.  B.  Kruskal,  "Multidimension.'.l  scaling  by  op¬ 
timizing  goodness  of  tit  to  a  nonmetric  hypothe¬ 
sis,"  Psychometrlka,  vol.  29,  pp.  1-27,  1964. 

[4]  J.  B.  Kruskal,  "Nonmetric  multidimensional  scaling: 

A  numerical  method."  Psychometrlka ,  vol.  29,  pp. 
115-129,  1964. 

[5]  R.  N.  Shepard,  "Analysis  of  proximities:  Multi¬ 
dimensional  scaling  with  an  unknown  distance 
function.  I . , "  Psychometrlka ,  vol.  27,  pp.  1 25— 

140,  1962.  . . 

[6]  R.  N.  Shepard,  "Analysis  of  proximities:  Multi¬ 
dimensional  scaling  with  an  unknown  distance 
function.  II.,"  Psychometrlka,  vol.  27,  pp.  219- 
246,  1962. 

[7]  J.  B.  Kruskal,  F.  W.  Young,  and  J.  B.  Seery,  How 
to  use  KYST-2,  a  very  flexible  program  to  do 
multidimensional  scaling  and  unfolding.  Murray 
Hill,  NJ:  Bell  Telephone  Laboratories,  1977. 

[8]  R.  N.  Shepard,  "Representation  of  structure  in 
similarity  data:  Problems  and  prospects," 
Psychometrlka ,  vol.  39,  pp.  373-421  ,  1974.. 

[9]  M.  G.  Kendall,  Rank  correlation  methods.  London: 
Griffin,  1948,  chap.  6. 


90 


U  u  wu  v 


> 

DECISION  AND  DISPLAY  ANALYSIS  IN  A  SIMPLE  SURVEILLANCE  PROBLEM 

Frank  L.  Greitzer  and  Ramon  L.  Hershman 


Navy  Personnel  Research  and  development  Center 
San  Diego,  CA  92152 


“  Summary 

/y. 

3  "  Last  year  at  this  workshop  we  reported  on  human  per- 

■f*  formance  in  a  decision  making  task  posed  in  terms  of 
target  surveillance.  The  target  was  either  at  a  pre¬ 
vious  fix  or  had  moved  to  a  new  location  offset  by  a 
known  distance  and  random  angle.  The  problem  was  to  de¬ 
cide,  based  on  a  noisy  sample  of  data,  which  of  these 
two  states  was  true.  To  assess  human  information  pro¬ 
cessing  abilities  we  derived  an  optimal  processor  to 
compare  with  human  performance  data.  In  this  report  w 
pursue  the  relationship  between  visual  and  auditory 
representations  of  the  task  and  describe  effects  of 
alternative  representations  on  human  performance.  ^ _ 

Problem  Definition 


Oetails  of  the  problem  and  its  analysis  were  de¬ 
scribed  earlier  [1].  Briefly,  we  assume  that  the  target 
is  stationary  when  the  data  are  observed  and  that  its 
position  is  either  fixed  at  an  arbitrarily  defined  ori¬ 
gin  or  exactly  R  units  removed  in  any  direction.  Let  N 
observations  be  presented  as  dots  on  a  CRT  display, 
where  each  dot  encodes  the  reported  (x,y)  coordinates. 

Let  sensor  error  be  produced  by  the  circular  normal 
density.  Then  if  the  target  hasn't  moved,  the  dots  will 
tend  to  cluster  around  the  origin.  If  it  has  moved,  the 
dots  tend  to  cluster  around  its  new  locus  (RcosS,  Rsine) 
where  R  is  the  fixed  offset  distance  and  0  is  taken  to 
be  a  uniform  variable  on  (0,2ir). 

Thus,  the  decision  maker  must  decide  between  only 
two  possible  states  of  nature,  SQ  and  SR.  In  state  SQ 

the  target  is  at  (0,0);  in  state  SR  it  has  moved  R  units 

away.  Figure  1  (a)  shows  some  typical  stimuli  in  the 
original  "DOTS”  representation  for  samples  of  size  7 
from  SR  (signal)  and  SQ  (noise),  respectively.  On  each _ 

trial  the  observer  sees  one  such  stimulus  or  the  other 
and  must  decide  whether  SQ  or  SR  is  true.  Stimuli 

in  Figure  1  are  other  visual  representations  of  the 
identical  data. 


Optimal  Performance 

Requiring  the  decision  maker  to  minimize  the  expect¬ 
ed  cost  of  decisions,  we  derived  the  optimal  Bayes  pro¬ 
cedure  and  the  optimum  rule  (see  [1]  for  details),  viz. ; 

Decide  SR  if  and  only  if  d  >  d*. 

where  £  is  the  observed  distance  from  the  origin  to  the 
center  of  mass  (centroid)  of  the  data;  and  the  criterion 
d*  contains  a  modified  8esse1  function  of  order  zero  and 
depends  on  the  costs,  priors,  sample  size  N,  error 
2 

variance  o  ,  and  R.  The  criterion  £*  is  a  decreasing 

function  of  N/c2  and,  for  symmetric  costs  and  priors, 
approaches  R/2  in  the  limit. 


(a)  mts 

SIGNAL 

NOISE 

0. 
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(b)  SPIKES 

& 

Q 

(c)  GUPHS 

& 

O 

W  wedges 

X 

(e)  ZAGS 
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Figure  1.  Visual  displays  used  in  the  decision 
making  experiments.  The  five  stimuli  shown  as 
"signals"  (and  those  as  “noise")  represent 
Identical  samples  of  size  7.  All  of  the  stimuli 
encode  the  same  information;  viz. ,  in  polar 
coordinates  the  encoding  Is  (r.,  e{)  represent¬ 
ing  range  and  bearing  from  the  origin. 

Explanation  of  stimuli:  (a)  DOTS--original 
spatial  representation  of  points  in  the  plane. 

(b)  SPIKES— spikes  are  drawn  at  bearings  6^ 

and  with  lengths  proportional  to  r^.  Size  of 

circle  is  arbitrary,  (c)  GLYPHS— same  as  SPIKES 
except  rays  are  used,  (d)  WEDGES— wedges  extend 
from  the  origin  to  each  sampled  point. 

(e)  ZAGS— vectors  of  length  r.  and  orientation  0^ 

are  successively  appended. 
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In  the  context  of  the  theory  of  signal  etectability 
[2]  we  regard  the  movement  of  the  target  as  a  "signal'1 
to  be  detected  and  obtain  the  probabilities  of  detection 
and  false  alarm.  These  are,  respectively,  the  probabi¬ 
lities  that  the  observe!  distance  ^  exceeds  4*,  given 
and  S,j.  Figure  2  shows  theoretical  Receiver  Operating 
Characteristic  (ROC)  cu-ves  for  sample  sizes  N  =  1,  3,  5, 
7,  and  9  with  R  =  1.2  units  and  o  =  1.  For  a  given  sam¬ 
ple  size,  a  decision  maker  who  uses  the  distance  statis¬ 
tic  d  is  constrained  to  the  given  curve. 

Hum»n  Performance 
Spatial  Representations 

We  assigned  equal  priors  and  collected  performance 
data  for  the  DOTS  task  with  sample  sizes  N  *  1,  3,  5,  7, 

2 

and  9;  o  =1.  A  microcomputer  displayed,  on  each  trial, 
the  circle  of  radius  R  =  1.2  units  (2.2  cm)  and  plotted 
the  sampled  data  as  points  in  the  plane.  We  used  both 
yes/no  and  rating-scale  procedures  (see  [2],  pp.  32-43) 
for  data  collection.  Feedback--right  or  wrorig--and  the 
correct  location  of  the  target  were  given  after  each 
play.  Three  observers  were  tested  for  approximately  500 
plays  at  each  sample  size;  the  last  200  plays  were 
analyzed.  Representative  rating-scale  ROC  curves  for 
one  observer  are  plotted  as  dashed  lines  in  Figure  2. 

We  note  that  our  observers  only  approximated  the  optimal 
procedure,  and  although  they  improved  as  N  increased, 
they  failed  to  extract  all  the  available  information. 


We  also  tested  other  visual  representations  of  the 
data.  In  this  regard,  any  information  display  consti¬ 
tutes  some  level  of  decision  aiding.  At  one  extreme 
would  be  a  display  of  numbers  for  the  x,y  coordinates. 

At  the  other  extreme  would  be  simply  showing  the  optimal 
decision.  Our  displays  in  Figure  1  lie  between  these. 

We  sought  alternative  representations  that  preserved  the 
dimensions  of  the  original  task.  Thus,  every  display 
codes  distance  and  angle  information  for  each  observa- 
tion--these  dimensions  remain  unintegrated  and  tneir 
processing  is  left  tc  the  observer.  Mathematically,  the 
task  is  identical  in  every  case.  However,  the  cognitive 
processing  required  seems  quite  different:  Shapes  of 
stimuli  lj-e  seem  more  salient  than  distances  to  cen¬ 
troids  . 

We  collected  performance  data  for  the  five  displays 
in  Figure  1  and  found  little,  if  any,  differences;  the 
ROC  curves  appear  in  Figure  3.  The  failure  to  find 
d-fferences,  we  believe,  attests  to  the  versatility  of 
human  information  processing.  Ue  note,  however,  that 
observers  had  unlimited  time,  no  stress,  and  no  addi¬ 
tional  workload  conditions— perhaps  such  manipulations 
would  yield  differences  in  performance . 

Acoustic  Representation 

As  we  noted  last  year,  the  solution  to  our  spatial 
task  is  identical  to  the  solution  of  a  well-known  pro¬ 
blem  in  acoustic  and  radar  signal  processing:  detecting 
in  Gaussian  noise  a  sinusoid  signal  known  exactly  except 


Figure  2.  Optimal  (solid  curves)  and  human  (open  circles)  ROC  curves  for  sample 
sizes  N  »  1,  3,  5,  7,  and  9;  observed  curves  are  each  based  on  200  trials. 
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Figure  3.  Observed  ROC  curves  for  the  five 
information  displays  with  M  *  5;  R  *  1.2,  a*  *  1. 
Each  curve  is  based  on  200  trials  with  a  rating- 
scale  procedure.  [0  »  DOTS,  S  *  $°IK£S, 

G  *  GLYPHS,  W  *  WEDGES,  Z  *  ZAGS] 


for  phase.  Here  the  input  is  passed  through  a  narrow- 
band  filter  tuned  to  the  signal  frequency;  the  amplitude 
of  the  resultant  envelope  is  then  submitted  to  a  cri¬ 
terion  device  to  reach  a  decision.  There  is  a  strict 
isomorphism  between  this  acoustic  problem  and  the  spa¬ 
tial  task  for  our  observers.  The  unknown  phase  of  the 
sinusoid  is  precisely  the  unknown  angle  0  of  the  tar¬ 
get's  movement  in  the  spatial  domain.  The  amplitude  of 
the  envelope  output  by  the  narrow-band  filter  is  the 
distance  statistic  i  in  our  task.  Ue  find  no  prior 
reference  :o  this  striking  isomorphism  in  either  the 
engineering  or  behavioral  science  literature.  We  were 
particularly  interested  in  comparing  human  performance 
with  different  sensory  modalities  on  these  mathematical¬ 
ly  identical  tasks. 

A  psychoac.ustic  experiment  reported  in  1964  [3] 

(see  also  [2]  and  [4])  met  the  conditions  of  our  task 
and  essentially  played  the  spatial  problem  through  the 
observer's  ears.  A  sliding  rating  scale  was  used  to 
report  the  observer's  confidence  that  the  sine  wave  was 
present  and  this  afforded  a  precise  plot  of  the  ROC 
curve.  One  of  the  conditions  in  [3]  had  acoustic  para¬ 
meters  Ej/Nq  -  '4  that  are  mathematically  equivalent 

to  a  sample  size  of  19.4  in  our  spatial  task  [5].  The 
auditory  ROC  curve  is  shown  in  Figure  4  (open  circles). 
Note  that  the  acoustic  task  was  much  easier  (in  theory) 
than  any  of  our  conditions.  However,  the  auditory  RCC 
curve  zpproximates  the  theoretical  curve  that  we  derived 
for  our  task  with  N  »  3;  it  also  is  close  to  our  ob¬ 
served  data  for  N  ■  3.  These  comparisons  indicate  a 
severe  loss  in  sensitivity  in  the  acoustic  modality. 

It  is  clear  that  humans  perform  far  better  in  the 
spatial  equivalent  of  the  detection  problem.  We  attri¬ 
bute  this  superiority  in  part  to  the  human's  limited 
ability  to  process  acoustic  phase  information  [6],  Co¬ 
herence  in  phase,  of  course,  speaks  to  the  presence  of 
the  signal— and  in  the  spatial  domain  the  variation  of 
phase  angle  for  the  N  observations  is  readily  observed. 
However,  this  information  is  not  accessible  acoustically 
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experiment.  An  observed  ROC  curve  for  a  well -practiced 
observer  with  this  visual  no-phase  condition  is  shown 
as  filled  circles  (dashed  lines)  in  Figure  4.  Note 
that  performance  in  the  visual  modality  is  still 
superior,  despite  the  removal  of  phase  information.  If 
the  only  shortcoming  in  the  auditory  task  were  an 
insensitivity  to  phase,  we  should  expect  these  empiri¬ 
cal  curves  to  coincide.  There  are  still  unaccounted 
for  processing  losses  in  the  auditory  domain. 

Some  additional  insights  Into  phase-deaf  processing 
in  this  task  are  discussed  In  the  Appendix. 


Beyond 'differences  in  phase-sensitivity,  what  other 
modality  effects  might  be  operating?  As  one  of  the 
Conference  attendees  suggested,  the  unexplained  auditory 
deficit  may  be  attributable  to  limitations  in  temporal 
integration  in  the  auditory  system.  In  the  auditory 
task  it  is  not  possible  to  present  all  of  the  data  simul¬ 
taneously  as  was  done  in  the  visual  display.  Limita¬ 
tions  of  the  auditory  system  in  integrating  information 
over  a  temporal  range  may  correspond  in  the  visual  do¬ 
main  to  the  number  of  dots  that  the  observer  can  inte¬ 
grate  with  brief  presentatir  s.  Thus,  sequential 
display  of  the  visual  data  should  further  reduce  per¬ 
formance;  whether  or  not  this  manipulation  can  account 
for  the  remaining  modality  differences  is  an  open 
question  that  merits  empirical  investigation. 

Discussion 

In  summary,  we  devised  a  simple  but  rich  surveil¬ 
lance  task  (that  could  easily  be  done  by  machine  alone) 
in  order  to  study  human  cognitive  abilities  and  limita¬ 
tions.  We  found  that,  in  the  visual  domain,  humans 
generally  (although  imperfectly)  follow  the  prescription 
of  the  ideal  observer;  and  that  human  cognitive  pro¬ 
cesses  are  quite  versatile  in  adapting  to  variations  in 
visual  display  representations.  We  identified  an  iso¬ 
morphic  acoustic  task  and  observed  that  humans  are 
markedly  inferior  in  this  modal ity— attributable  in 
large  part,  we  believe,  to  an  inability  to  process 
relevant  phase  information. 

As  we  have  seen,  auditory  processing  of  the  sur¬ 
veillance  problem  appears  to  have  inherent  deficiencies 
(phase-insensitivity)  that  may  not  be  possible  to  over¬ 
come  in  the  acoustic  domain.  By  presenting  the  infor¬ 
mation  visually,  substantial  gains  in  performance  are 
achieved.  Even  a  relatively  degenerate  visual  represen¬ 
tation  lacking  phase  information  produces  better  per¬ 
formance  than  observed  in  the  auditory  task  (Figure  4). 
The  more  complete  visual  representations  in  Figure  1 
provide  a  higher  level  of  "aiding"  that  yields  even 
better  performance.  Nevertheless,  human  performance  is 
still  not  optimal . 

Toward  understanding  the  causes  of  this  sub¬ 
optimality,  we  conducted  some  informal  experiments  that 
revealed  human  biases  in  computing  the  centroids:  Seme 
individuals  terd  to  give  undue  "weight"  to  outlying 
points,  while  others  tend  to  do  the  opposite.  We  have 
not  pursued  the  nature  of  such  biases  in  the  various 
spatial  displays  in  Figure  1.  One  expects  that  these 
stimuli  might  emphasize  different  characterist ics  of  the 
data,  and  that  they  could  be  used  to  induce  different 
biases  in  performance.  But  the  minimal  differences  in 
performance  with  these  stimuli  (Figure  3)  must  temper 
these  expectations. 

Finally,  we  are  particularly  interested  in  the  na¬ 
ture  of  the  cognitive  processes  that  arc  brought  to  bear 
on  this  generic  problem.  Our  current  f'ans  are  to  con¬ 
centrate  more  heavily  on  modeling  the  numan  operator 
(rather  than  his  optimal  counterpart)  in  an  effort  to 
reveal  these  processes  and  the  sourcas  of  their  sub¬ 
optimality.  We  also  plan  to  address  experimentally 
some  of  the  factors  underlying  the  efficacy  and  utili¬ 
zation  of  decision  aids  in  this  simple  problem  so  that 
we  may  bring  about  more  effective  implementations  of 
decision  aids  in  the  operationa1  Navy. 
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Appendix: 

"Optimal"  Phase-Deaf  Performance 


If  indeed  human  observers  are  unable  to  process 
phase  information  in  the  auditory  task,  then  it  is 
appropriate  to  compare  their  performance  with  an 
"optimal"  processor  who  is  similarly  deprived.  Such 
an  analysis  would  also  provide  the  theoretically 
optimum  performance  to  which  we  can  compare  the  visual 
no-phase  data  shown  in  Figure  4. 

One  obtains  this  theoretical  processor  in  the  same 
manner  as  in  the  original  problem  that  we  reported  in 
[1];  i.e.,  derive  the  likelihood  ratio  of  the  density 
functions  given  states  SR  and  Sg. 

Given  Sg,  the  density  function  for  each  observation 

is  circular  normal  when  phase  information  is  available. 
In  the  ohase-deaf  .ase,  only  the  range  r ,  is  given;  in 

this  case  the  observation  r.  has  the  Rayleigh  distri¬ 
bution. 


Vri)  =  (|V°2)exP(-ri/2°ZK 

where  the  "0"  in  the  subscript  denotes  state  SQ  is  given. 

The  joint  density  of  the  N  independent  observations  in 
the  Simple  is,  of  course,  the  product  of  N  such  terms. 

Given  SR,  the  range  r^  for  each  observation  has  a 
Rice  distribution, 

fR{ri)  =  (r1^)exp[-(r^)/^]I0(R<rV). 

where  Ig(.)  is  the  modified  Bessel  function  of  the 

first  kind  and  order  zero.  The  joint  density  of  the  N 
independent  observations  is  the  product  of  N  such  terms. 
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The  likelihood  ratio  is  then 
N 

fR/fQ  =  e*p(-R2/2a2)TTl0iri)/o2), 

and  the  decision  requires  a  comparison  of  its  value  for 
the  ouserved  data  with  a  constant  K  *  (crpq/CqPr)  given 

by  the  prior  probabilities  p0  and  oR  and  the  costs  cR 
(false  alarm)  and  cQ  (miss);  cf_.  [']•  Just  as  in  the 

case  of  the  phase-sensitive  processor,  this  phase- 
deprived  processor  uses  Bessel  functions  on  distances 
to  arrive  at  a  decision.  But  in  this  case  we  were 
unable  to  simplify  the  expression.  Therefore,  we  gener¬ 
ated  "optimal1'  no-phase  performance  by  computer  simula¬ 
tion  of  10,000  trials  to  arrive  at  predicted  ROC  curves. 

These  optimal  curves  are  shown  in  Figure  5  for 
sample  sizes  1,  3,  5,  10,  and  20;  and  parameters  R  -  1.2 
2 

and  o  *  1.0.  The  observed  data  from  the  visual  no¬ 
phase  task  in  Figure  4  are  re-plotted  in  Figure  5  for 
comparison  with  optimal  performance. 

What  can  be  inferred  from  the  optimal  models 
(both  with  and  without  phase)  about  the  way  that  our 
observers  process  the  information?  Cased  on  the 
theoretical  analyses,  it  is  clear  that  the  no-phase 
condition  is  computationally  more  difficult  then  the 
phase-sensitive  case.  The  phase-sensitive  processor 
merely  computes  the  sample  statistic  d  and  compares  it 
with  the  criterion  d^*  (which  depends  on  the  computation 
of  a  single  Bessel  Junction).  A  human  observer  in  the 


original  task  with  phase  information  displayed  (e.g,, 
DOTS)  does  not  actually  need  to  compute  Bessel  functions; 
it  is  sufficient  to  compute  £  and  then  base  the  decision 
on  an  estimate  of  d*  that  is  acquired  through  learning. 

On  the  other  hand,  the  products  of  Bessel  functions  that 
arise  in  the  no-phase  case  apparently  do  not  reduce  to 
such  a  simple  comparison.  It  does  not  seem  likely  that 
humans  could  perform  such  complicated  computations  in 
their  heads— in  fact,  we  found  that  a  simpler  (but  not 
opt'nal)  rule  based  only  on  the  mean  of  the  sampled 
di  .ances  does  nearly  as  well  as  the  optimal  (the  ROC 
curves  for  this  processor  nearly  coinride  with  those  of 
Figure  5).  Thus,  as  in  the  original  task,  the  human 
decision  maker  can  perform  this  task  very  adequately 
merely  by  computing  means  and  learning  through  adapta¬ 
tion  where  to  place  the  decision  criterion. 

It  is  elso  of  interest  to  compare  the  optimal  ROC 
curves  in  Figure  5  (without  phase  information)  with 
those  in  Figure  2  (with  phase  information),  hirst, 
note  that  for  N  -  1  both  theoretical  curves  coincide, 
as  we  should  expect  since  one  observation  does  not 
provide  any  useful  phase  information.  Second,  we  find 
that  the  no-phase  curve  for  H  *  20  virtually  coincides 
with  the  phase-sensitive  curve  for  N  =  7.  We  interpret 
this  to  mean  that  the  theoretical  sensitivity  for  N  =  20 
(no  phase)  is  the  same  as  that  for  N  «  7  (with  phase). 

We  might  guess  that  human  performance  curves  for  these 
two  conditions  would  also  closely  agree—and  this  is  in 
fact  the  case,  as  a  comparison  of  Figures  5  and  2 
reveals.  While  this  agreement  is  gratifying,  it  would 
be  very  useful  to  specify  a  functional  relationship  be¬ 
tween  the  ROC  curve  sensitivities  in  the  two  conditions. 


Figure  5.  ROC  curves  for  an  optimal  processor  deprived  of  phase  information; 
sample  sizes  N  »  1,  3,  5,  10,  and  20.  Each  curve  was  generated  by  simulation 
of  10,000  trials.  Plotted  points  are  for  a  human  observer  with  N  *  20  (from  Fig.  4). 
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^  SUMMARY 

y 

Eight  subjects  performed  a  probablistic  inforration 
integration  task  in  which  miltiple  cues,  and  their 
diagnostic  value  in  choosing  between  the  hypotheses. 
Primary  interest  focussed  on  the  occparison  of  digi¬ 
tal  and  analog-graphical  displays,  and  upon  sources 
of  non-optimality  in  the  Inforration  integration  pro¬ 
cess  .  Generally  the  analog  displays  proved  superior 
to  the  digital,  and  performance  was  better  when  in¬ 
formation  was  presented  at  a  more  rapid  rate.  Sub¬ 
jects  appeared  to  be  non-optlmal  in  their  treatment 
of  reliability  vs.  diagnostic! ty.  However,  departures 
from  > optimality  related  to  serial  position  effects 
and  to  the  use  of  the. appropriate  model  of  inforra- 
tionlntegrati  on  were  hot  observed.  ^ 


INIHOtWCriCN 


In  modern  oornmnd,  control,  and  comnunlcation 
(C3)  situations,  the  executive  decision  taker  is  of¬ 
ten  faced  with  the  task  of  rapidly  integrating  a 
large  number  of  sources  of  probablistic  Information 
or  cues,  that  bear  on  the  likelihood  that  one  or  more 
competing  hypotheses  my  be  correct.  The  limits  of 
human  decision  taking  in  such  situations  have  been 
aoply  documented  (Wallsten,  1980;  Slovic,  Flschhoff, 
A  Lichtenstein,  1977; .Wickens,  1983).  In  the  present 
report  we  consider  the  Impact  of  three  particular 
cognitive  limitations  in  a  simulated  C3)  scenario  in 
which  miltiple  sources  or  probablistic  information 
are  to  he  integrated  and  one  of  two  hypotheses  are  to 
be  chosen.  ! 


(1)  Sequential  presentation.  Wh-n  probablistic 
cues  are  presented  sequentially  over  time  there  are 
cwo  Important  ways  in  which  humans  have  been  found  to 
depart  from  the  optimal  mnner  of  integration,  i) 
They  have  been  found  to  give  too  mich  weight  both  to 
the  first  cues  in  the  series  (anchoring  or  primacy), 
as  well  as  to  the  final  cues  (raoency),  in  situations 
in  which  all  cues  should,  optimally  be  provided  equal 
weight.  These  are  known  as  serial  position  effects, 
ii)  In  Bayesian  decision  raking  tasks,  subjects  often 
adopt  ’^averaging  models"  of  information  Integration 
in  which  lnfomatlcn  that  weakly  supports  a  hypothe¬ 
sis  serves  to  reduce  subjective  confidence  in  that 
hypothesis,  relative  to  the  alternative.  In  fact, 
weak  evidence  should,  optimally  Increase  confidence, 
although  ty  a  lesser  degree  than  strong  evidence 
(Lopes,  1982). 

(2)  Reliability  and  dlagnosticlty.  A*  given  cue 
my  be  related  probablistically  to  a  hypothesis  in 

one  of  two  ways,  (i)  Its  diagnostic! ty  (D),  the 
probability  of  the  hypothesis  given  the  cue  value, 
may  be  less  than  cne.  A  oonpletely  nou-di  agnostic 


cue  (D  -  0)  ray  be  a  synptcm  that  is  equaly  likely 
under  each  of  two  hypotheses.  (11)  Its  reliability 
(R),  the  probability  that  the  cue  actually  has  that 
value,  given  the  observed  value,  ray  be  less  than 
one.  An  unreliable  cue  value  will  have  0  correlation 
with  the  actual  cue  it  purports  to  represent.  When 
cues  vary  in  both  D  and  R  the  optlnun  decision  maker 
should  down  weight  the  cue  valence  equivalently  for 
both.  Yet  evidence  suggests  that  humans  do  not  al¬ 
ways  do  so,  but  instead  ray  treat  probablistic  evi¬ 
dence  as  if  it  were  totally  reliable,  when  there  1s 
also  variability  in  D  (Johnson  et  al.,  1973;  Wickens, 
1983).  This  bias  will  be  referred  to  as  the  "as  if" 
heuristic. 

(3)  Display  format.  The  probablistic  values  oon  • 
ce.mlng  R  and  D  ray  be  presented  in  digital  or  anr.loe 
format.  Exprimental  data  suggest  that  precise  digit¬ 
al  readings  ray  not  be  an  effective  way  of  integrat¬ 
ing  rapid  information  in  order  to  obtain  a  "ballpark" 
analog  estimate  of  seme  value  (in  this  case  confi¬ 
dence).  The  theory  of  stimu lua/centra  1  processing/ 
response  (S-C-R)  compatibility  proposed  by  Wickens, 
Sandry,  and  Vidullch  (1983)  suggests  that  information 
requiring  analog  operations  in  working  memory  (such 
ms  the  updating  at  a  continuous  scale  of  confidence) 
will  be  best  served  by  graphlcal/plctoral  display 
formats. 


EXPERIMENT  AND  Idl'OD 

In  the  present  experiment  our  subjects  partici¬ 
pated  in  a  tactical  battlefield  scenario.  They  were 
to  Imagine  themselves  as  commanders  of  a  defensive 
unit  preparing  to  be  attacked  from  one  of  two  direc¬ 
tions,  North  or  Scxith.  In  each  decision  problem,  di¬ 
agnostic  information  concerning  the  neat  likely  di¬ 
rection  of  enemy  attack  was  provided  by  a  aeries  of 
6,  8,  or  10  intelligence  cues  (varied  between  prob¬ 
lems),  presented  at  either  a  slow  (5  sec/cue),  or 
fast  (3  sec/cue)  rate.  Each  cue  was  identified  by 
its  source  (i.e. ,  air  surveillance,  ground  topogra¬ 
phy),  its  dlagnosticlty  (from  0  -  1.0),  and  its  reli¬ 
ability  (0  •  1.0).  Problems  differed  in  terms  of  the 
net  amount  of  difference  offered  in  favor  of  the 
most  likely  hypothesis,  and  this  variable  created 
three  levels  of  support:  wg&k,  medium,  and  strong. 
Most  iaportantly,  for  half  of  the  problems,  informa¬ 
tion  was  presented  in  the  'verbal  format:  a  series  of 
cues  of  the  form  shown  in  Figure  1.  For  the  other 
half,  the  spatial  format  of  Figure  2  was  employed. 
One  particular  advantage  of  the  spatial  format,  in 
which  R  and  D  defined  the  width  and  height  of  a 
rectangle  respectively,  lay  in  the  fact  that  the 
total  worth  of  valence  of  the  cue  is  equal  to  the 
product  of  R  x  D.  That  is,  the  area  of  the  rectangu¬ 
lar  cue.  To  the  extent  that  area  in  a  cunmodity  that 
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Is  directly  perceived,  then  the  cognitively  loading 
aental  multiplication  cif  R  x  D  is  avoided  in  the  spa¬ 
tial  display.  The  subject  must  singly  integrate  are- 
»s  for  one  hypothesis  or  the  other. 


Figure  1:  Information  display  with  verbal  code  format. 

i 


Figure  2:  Information  display  with  spatial  code  format. 


Eight  Bale  subjects  performed  in  the  experiment 
over  a  period  of  two  -  1  i/2  hour  sessions.  Each 
subject  received  a  total  of  72  decision  problems. 
This  number  ms  formed  by  the  orthogonal  combination 
of  two  display  formats  x  three  levels  of  problem  size 
x  three  levels  of  net  evidence  x  two  presentation 
speeds  x  two  levels  of  cue  variability.  This  last 
variable  manipulated  the  degree  of  correlation  be¬ 
tween  R  and  D  within  a  problem,  but  will  not  be  dis¬ 
cussed  in  detail  here  Each  problem  consisted  of  an 
alternating  sequence  between  cues  in  favor  of  the 
North  and  too  Scuth.  On  half  of  the  trials  the  net 
evidence  favored  the  North  and  on  l*lf  the  South  was 
favored.  Because  the  cues  alternated,  on  half  of  the 


problems  the  favored  hypothsis  was  supported  by  the 
first  cue  and  on  half  it  was  supported  by  toe  last. 
This  difference  was  used  to  examine  serial  position 
effects. 


RESULTS 

Problem  and  display  variables.  Table  1  shews 
the  mean  accuracy  of  judgments  as  a  function  of  each 
of  the  independent  variables  examined  in  isolation. 
There  were  no  interactions  between  these  variable? 
and  hence  we  present  only  toe  row  and  column  means. 
Statistical  analysis  of  these  data  revealed  that  the 
spatial  display  gave  reliably  nore  accurate  judgments 
than  the  verbal  (T7)  «  3.5,  p  <  .01),  that  decision 
accuracy  was  lower  at  the  slower  presentation  rate 
(T7  *  4.27,  p  <  .01),  and  was  lower  with  more  vari¬ 
able  cue  values  (T7  ■  2.17,  p  <  .05). 

More  precise  information  regarding  toe  use  of  toe 
probabllstlc  information  was  provided  by  the  analysis 
of  confidence  ratings.  These  are  shown  in  Figures  3 
and  4.  The  actual  ratings  were  transformed  to  a 
range  of  0  -  20.  On  this  scale,  10  indicates  neutral 
confidence,  20  extreme  confidence  in  the  correct  hy¬ 
pothesis.  Values  less  than  10  occurred  when  subjects 
Indicated  oonfldence  of  varying  degrees  in  toe  incor¬ 
rect  hypothesis.  The  confidence  data  are  shown  in 
Figures  3  and  4.  Figure  3  depicts  the  strong  effect 
of  toe  three  levels  of  absolute  difference  In  evi¬ 
dence  on  the  confidence  rating  (F2,  14)  *25.1,  p  < 
.01).  These  (feta  are  isgortant  In  that  they  indicate 
that  subjects  performed  toe  task  appropriately,  ex¬ 
tracting  more  evidence  from  toe  data  as  more  evidence 
was  warranted.  The  importance  of  toe  linearity  in 
these  data  will  be  discussed  further  below.  Net  evi¬ 
dence  did  not  interact  with  any  of  toe  other  vari¬ 
ables. 
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Figure  3:  Effect  of  net  difference  in  evidence  be¬ 
tween  the  hypotheses  (objective  confidence) 
on  subjective  confidence. 
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Figure 4 :  Combined  effects  of  problem  size,  display 
format,  and  presentation  speed  on  subjec¬ 
tive  confidence. 


Figure  4  presents  the  combined  effects  of  three 
additional  variables  cn  the  confidence  ratings:  prob¬ 
lem  size,  display  format,  and  presentation  speed. 
The  figure  indicates  that  confidence  was  greater  at 
the  fast  speed  (Fi,7  “  5.88,  p  <  .05),  particularly 
with  the  verbal  format  (F  interaction  “6.34,  P  < 
.05).  In  addition,  the  figure  reveals  the  interaction 
between  presentation  speed  and  problem  size  (F2,7  ” 
5.97,  p  <  .01).  Speed  had  little  influence  when  the 
problems  consisted  of  6  or  8  cues,  but  when  the  prob¬ 
lems  were  long,  decision  performance  was  lowered  at 
the  slow  speed. 

Optimality  in  information  integration.  Figure  3 
reveals  that  subjects  extracted  more  diagnostic  in¬ 
formation  as  progressively  more  information  was  at 
fered.  What  nakes  these  cbta  particularly  signifi¬ 
cant  is  the  high  degree  of  linearity  between  the 
objective  and  subjective  confidence.  Through  analy¬ 
sts  and  model  fitting  described  elsewhere  (Scott 
fcWickens,  1983),  we  conclude  that  subjects  are  inte¬ 
grating  the  probablistic  information  from  the  several 
cues  in  the  optinal  nanner,  rather  than  following  the 
non-optlmal  averaging  strategy. 

A  second  analysis  was  performed  to  determine  if 
there  were  narked  serial  position  effects  in  the 
data,  indicative  of  either  anchoring  or  recency.  This 
was  done  by  oonparing  performance  on  trials  when  the 
correct  hypothesis  was  supported  by  the  first,  and  by 
the  last  cue  in  the  problem.  Neither  accuracy  nor 
confidence  differed  between  these  two  sorts  of 
trials,  allowing  us  to  conclude  that  either  serial 
postion  effects  did  not  occur,  or  if  they  did,  then 
the  primacy  and  recency  effects  perfectly  balanced 
each  other. 


Finally,  a  second  small  experiment  was  conducted, 
using  the  same  subjects,  to  determine  the  extent  to 
which  R  and  D  may  have  been  treated  asymnetrically , 
with  subjects  showing  biases  to  overweight  low  levels 
of  R,  applying  the  "as  if"  heuristic  and  treating 
those  cues  "As  if"  they  were  fully  reliable.  This  de¬ 
termination  was  acconplished  by  constructing  nro'olems 
such  that  if  the  'las  if"  heuristic  were  employed, 
subjects  would  be  induced  to  shift  confidence  toward 
the  incorrect  hypothesis,  thereby  producing  errors 


and  Icweriug  their  net  oonlldence  rating.  The  results 
of  this  second  experiment  revealed  that  subjects  sere 
in  fact  non-optimal  in  this  regard,  overweighting 
cues  with  low  levels  .<  reliability. 


DISCUSSION 

The  practical  implications  of  the  present  data 
are  two-fold.  On  the  one  hand,  the  differences  in 
display  format  suggest  an  advantage  to  the  spatial 
display  for  the  kinds  of  judgments  requested  here. 
When  information  is  presented  at  the  relatively  rapid 
rate,  characteristic  of  both  the  slow  and  fast  speed, 
decision  makers  cannot  be  expected  to  perform  the 
necessary  mental  oultiplication  on  the  numerals  of 
the  verbal  foT-mat  in  order  to  determine  the  aggregate 
evidence.  The  rectangular  analog  display  provides 
this  information  in  a  more  direct  compatible  fornat. 
The  advantage  ot  the  spatial  display  is  also  support¬ 
ed  ty  the  interaction  of  display  format  and  presenta¬ 
tion  speed  on  confidence.  When  the  display  Is  spa¬ 
tial  ,  confidence  is  unaffected  by  speed.  When  it  is 
verbal  on  the  other  hand,  confidence  is  i educed  at 
the  slower  speed,  when  the  burdens  an  working  memory 
are  enhanced. 

The  second  set  of  implications  pertains  less  to 
engineering  guiielines  in  system  design,  than  to  an 
appreciation  of  the  kinds  of  cognitive  limitations 
faced  by  the  decision  nak.er  in  the  aultl-elcment 
decision-naking  task.  Recognizing  that  the  effects 
observed  here  may  be  para  .gjn-specific,  these  nay  be 
briefly  summarized  as  follows: 


1)  Information  integration  is  limited  by  memory 
factors.  When  the  problems  are  long  (10  cues) 
and  the  display  speed  is  slow,  performance  as 
measured  by  confidence,  deteriorates. 

2)  Subjects  appear  to  provide  cues  of  low  reli¬ 
ability  with  greater  weight  than  is  opt  imp’.. 

3)  On  the  brighter  side,  subjects  appeared  to  use 
the  appropriate  model  of  information  integra¬ 
tion,  eschewing  an  "averaging  model"  in  favor 
of  one  that  sums  information  for  alternative 
hypotheses.  Also  there  is  little  evidence  in 
the  present  data  for  nonoptinal  overweighting 
of  either  early  (primacy)  or  late  (recency) 
cues  in  the  sequence. 

While  it  in  possible  in  the  long  run  that  engi¬ 
neering  corrections  may  be  implemented  to  circumvent 
those  cognitive  limitations  of  the  second  set  that 
are  encountered,  it  is  at  least  Important  in  the 
short  run  that  their  existence  be  acknowledged. 
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Table  1 

Percent  Accuracy 


Trial 

Low 

High 

Variability 

97.2% 

92.3% 

Coding 

Spatial 

97.fi% 

Verbal 

92.0% 

Time 

Slow 

93.4% 

Fast 

96.2% 

Weighted 

5-10% 

15.20% 

25-30 % 

Difference 

94.2% 

94.3% 

94.8% 

Set  Size 

6(Total) 

8(Total ) 

10(Total) 

95.3% 

96.3% 
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The  development  of  a  Generalized  Huaan-Hachlne 
Interface  la  driven  by  eonalderatlon  of  huaan 
communication  capabilities  and  lialtatlc  .a.  The  goal 
ia  to  develop  a  syatea  which  provides  aachlne 
capabilities  aiollar  to  those  required  for 
communication  among  huaan  beings.  Syatea  features 
resulting  froa  this  approach  and  incorporated  in  the 
design  include i  application  independence,  attention 
monitoring,  dynaaic  device  assignment,  huaan 
performance  monitoring,  and  natural  language 
processing.  In  addition,  a  special  data  aanagemmnt 
structure  has  been  designed.  Syatea  architecture  and 
development  progress  are  described.  _ _ 


the  moat  intuitive  model  they  can  bring  to 
communication  with  a  machinal  to  the  extent  that  the 
model  is  reflected  ( i.  e. ,  not  violated)  by  the 
machine’ a  activity,  communication  between  huaan  and 
machine  will  be  simplified,  efficient,  and  effective. 
The  goal  of  the  GXHI  program  ia  to  develop  a 
computer-based  model  of  the  structures  of  huaan 
communication. 


System  Overview 


Introduction 


There  is  little  disagreement  that  the  huaan  ia  a 
necessary  and  critical  element  in  complex  syateaa,  and 
that  poor  performance  on  hia  part  may  impair  the 
effectiveness  of  those  systems.  As  acknowledgement  of 
this,  there  has  been  e  recent,  renewed  emphasis  on 
'Human  Factors  Engineering’  and  related  issues  (such  as 
■user-friendliness* )  in  the  design  and  development  of 
new  systems.  However,  the  primary  focus  is  often 
placed  on  the  input/cutput  devices  to  be  used  in  the 
human-machine  interface  (111  moreover,  the  findings  of 
recent  psychological  research  srv  often  unknown  to  the 
applied  csmaunlty,  where  those  basic  results  would  have 
a  significant  impact  on  the  design  and  development  of 
new  systems.  *  most  distressing  csneeuwancc  of  this 
situation  ia  th.t  many  systems  are  designed  and 
developed  under  circumstances  where  there  la  rarely 
sufficient  time  or  money  to  eveluate  design  decisions, 
and  those  design  decisions  are  only  slightly  influenced 
by  empirical  results  that  have  been  obtained  under 
scientifically  valid  conditions. 

I  suggest  that,  while  the  lnput.’output  devices  are 
eseential  components  of  the  human-machine  interface, 
they  are  not  a  sufficient  uracription  of  that 
interface.  In  addition,  although  many  properties  of 
the  human-machine  interface  that  we  consider  important 
are  present  implicitly  in  the  choice  and  application  of 
devlcea,  theme  more  subtle  features  are  often  obmcuredj 
by  an  obsession  with  the  salient,  physical 
characteristic  cf  the  hardware. 


An  overview  of  the  GHHI  is  presented  in  Figure  1.  The 
interface  Itself  consist*  of  two  major  components t  an 
advanced  console,  and  software  which  is  the  computer’s 
■communication  intelligence.*  The  GHHI  is  truly  an 
interface  —  between  the  huaan  being  and  a  ‘host 
system, ■  which  is  the  *reot  of  the  system*  of  which  a 
particular  human  and  GHHI  are  only  one  subsystem.  For 
example,  the  *hoet  syatea*  might  be  a  network  of  other 
huanne  and  their  workstations,  a  data  acquisition  and 
processing  system,  or  a  hast  computer  in  the  usual 
sens*. 
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FIGURE  1 


This  paper  will  descr. he  the  design  and  development  of 
a  Generalized  Huaan-Hachine  Interface  (GHHI).  This 
work  has  asauaed,  axlomatlcally,  that  if  huaan 
communication  with  aachlnea  is  to  be  successful, 
machines  must  ’know’  something  about  huaan 
communication.  That  ia,  coaaunlcatioi  among  humane 
depends  on  the  sharing  of  common  symbols,  common 
assumptions,  and  coeaon  attributes  of  huaan  cognitive 
abllitleei  since  human  communication  is  the  ’model’  of 
communication  of  greatest  familiarity  to  humane,  it  is 


A  specific  ’advanced  console*  is  only  on*  example  of 
many  possible  configurations.  A  particular  console 
configuration  will  depend  on  the  operations  and  tasks 
to  be  performed,  the  environment  (e.g. ,  land,  air,  or 
eea),  and  the  platform  (e.g.,  tank,  living  room,  or 
helicopter).  The  ’advanced  console*  is  a  generic 
reference  to  advanced  devices.  such  as  voice 
recognizers  and  touch  sensitive  displays.  These 
devices  are  advanced  by  virtue  of  their  being  easier 
for  humane  to  use  than  the  traditional  buttons,  knobs, 
switches,  and  keyboards. 
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The  GHHI  system  architecture  is  ’generalized*  in  the 
»*rv»e,  that  it  neither  depends  os  nor  incorporates  any 


1 


features  of  •  a  pacific  application.  Thin  la 
accoapllahad  through  a  coaplata  aaparatlon  of  tha 
atructuraa  of  tha  ayataa  (l.a. ,  tha  coda)  and  tha  data 
required  by  a  particular  ayataa.  This  design  feature 
la  not  a  alapla  consequence  of  adopting  good 
prograaalng  techniques.  Rather,  it  raaulta  qulta 
naturally  froa  our  uttaapt  to  nodal  huaan  abllltlaa, 
where  aany  cognltlva  attrlbutaa  and  akllla  do  not 
changa  aa  a  function  of  tha  lnforaatlon  baing 
aanlpulatad. 

Of  couraa,  for  thla  generalized  HHI  to  ba  uaad  In  a 
particular  ayataa  It  auat  hava  accusa  to  tha  ralavant 
data  and  conatructa  of  that  ayataa.  (By  analogy,  ona 
auat  know  about  ‘eater*,  ' movement",  and  ‘key*,  for  a 
dlacuaalon  of  aualc  aa  oppoaad  to  ‘rhyme  achaea*, 
•verse*,  'aataphor*,  and  ‘alliteration*  fir  a 
dlacuaalon  of  poatry. )  Such  lnforaatlon  la  aada  known 
to  tha  GHHI  through  a  process  callad  ‘tailoring*,  aa 
ahoan  in  Flgura  2.  Tha  application-apacific  faaturaa 
introducad  ara  tha  data  uaad  by  tha  GHHI  coaponanta 
(dlacuaaad  below),  and  tha  conaola  daclon.  vhich 
conalata  of  tha  choica  and  arrangaaant  of  Input /output 
davicaa.  Aa  a  byproduct,  thia  archltacturally-drivan 
approach  to  tha  davalopaant  of  an  application-apacific 
HHI  allova  aconoay  in  tha  coat  and  apaad  of  nav  ayataa 
davalopaant  by  having  tha  *haad-atart*  of  a  coaaon, 
baaic  ayataa. 


FIGURE  2 


GHHI  Coaponanta 

Tha  coaputar  coaponanta  of  tha  GHHI  vara  daaignad  to 
addraaa  apaclflc  huaan  capabllitloa  and  faaturaa;  tha 
corraaponding  huaan  and  coaputar  t  ilta  ara  ahovn  in 
Figure  3.  Tha  fifth  aajor  c  onant,  the  Data  Baae 
Interpreter  and  ita  aaaoclated  Data  Baae,  vaa  not 
dictated  prlaarlly  by  conalderatlon  of  huaan 
capabilltlea.  Rather,  It  allova  ua  to  preaerve  the 
application  Independence  of  tha  ayataa.  Theae  five 
aajor  coaponanta  are  deacribed  belov. 
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Hatural  Lanauaae/Hatural  Language  Processing 

Hatural  It  .guage  (21  la  a  prlaary.  If  not  tha  aoat 
laportant  aathod  of  huaan  coaaunicatlon.  Aa  tha 
concapta  to  ba  ‘dlacuaaad*  by  huaan  and  aachine  hava 
becoae  aora  coaplax,  and  coaputar  uaa  by  the  general 
population  haa  increased,  tha  nacaaalty  for  natural 
language  proceaaing  capabilltlea  in  coaputera  haa 
lncraaaad.  Tha  natural  language  capabilltlea  of 
huaana,  hovavar,  are  etill  far  iron  baing  fully 
understood.  Consequently,  natural  language  processors 
for  coaputera  are  aarginally  aucceaaful. 

In  developing  natural  language  proceaaing  aa  a  GHHI 
coaponant  ve  vera  concerned  prlaarlly  vith  lta 
inclusion  aa  ona  faatura  of  ‘coaaunicatlon 
Intelligence; •  tha  proceaaor  we've  designed  la  not 
coaparabla  to  those  developed  in  laboratories  vhere 
natural  language  proceaaing,  per  aa.  haa  been  tha 
prlaary  focua.  [31  Rathar,  tha  proceaaor  allova  a 
certain  aaount  of  flexibility  in  tha  vay  a  huaan  can 
laaua  raquaata  or  enter  data  to  tha  ayataa. 

Tha  Hatural  Lanauaga  Proceaaor  (HIP)  la  a  special 
coaponant  of  tha  aora  general  Decode  nodule  vhich 
interprets  "11  huaan  inputs  to  the  ayataa.  Tha 
translations  aada  by  tha  HLP  ara  determined  by  the 
graaaar  vhich  va  designed.  Thia  graaaar  handles 
quarlaa  about  tha  STATE  (43  of  an  OBJECT’S  ATTRIBUTE; 
tha  comparison  between  tha  STATEs  of  tvo  OBJECTS,  or 
between  an  OBJECT'S  STATE  and  sons  specified  VALUE;  tha 
CQKDITIOHALs,  IF,  HHEH,  HHEHEVER;  and  tha  entry  of  an 
OBJECT'S  VALUE.  The  processing  conalata  of  ‘keyword* 
identification  (l.a.,  lexical  analysis),  syntactical 
analysis.  In  which  STATEHEHT  CLAUSES  ara  synthesized  by 
alapla  left-to-rlght  processing  rules,  and  aeaantlc 
analysis,  in  which  features  Ilka  THIHG-ADJECTIVE  and 
OBJECT-STATE  agraaaant  ara  assessed. 


Attention  Sharing  and  Svltchlno/Attentlon  Honitorlno 
and  Switching 

Whether  va  consider  coaaunicatlon  between  tvo  people, 
or  private  thought,  it  la  rarely,  if  ever,  true  that  a 
‘topic*  la  axhauetad  with  no  intervening  digressions  or 
diversions  ia  other  ‘topica.  *  Intuitively,  such 
aultiplexlng  aaong  tc,ica  eppears  to  ba  a  useful, 
necessary,  and  possibly  unavoldabia  feature  of  huaan 
thought,  resulting,  at  tlaaa,  in  craativa  insights 
drawn  froa  consideration  of  seealngly  unrelated  ideas. 

In  performing  tasks,  either  alone  or  vith  others, 
switching  aaong  topics  ia  often  necessary  to  understand 
a  particular  point,  or  la  dictated  by  external  events 
vhich  pust  ba  attended  to  aa  they  occur.  Tha  switch 
froa  one  topic  to  another  is  rarely  aignalled  by  an 
explicit  indication,  and  usually  happens  with  no  loss 
of  general  continuity.  Horeover,  a  return  to  a 
deferred  topic  can  often  ba  accoapllahad  vithout  a 
aajor  recapitulation  of  completed  taaka  or  already 
dlacuaaad  ideas;  that  is,  tha  topic  can  ba  continued  at 
tha  point  vhara  it  vaa  suspended. 

A  couplet*  operation  on  soae  ayataa  will  often  conuiat 
of  aany  acre  tasks  than  can  reasonably  ba  active  at  any 
particular  aoaent.  On  the  ona  hand,  all  taaka  ara  net 
needed  at  all  tlaaa;  on  tha  other  hand,  there  ara 
Halts  to  a  huaan* a  multiplexing  abilities,  thus  asking 
-t>  at  best,  superfluous  and,  at  vorst,  confusing  to 
have  norm  tasks  active  than  ha  can  attend  to.  Thus,  at 
any  instant  in  an  operation,  only  a  aubsat  of  all  tasks 
should  ba  active;  hovavar,  tha  number  of  tasks  and 
their  idantltiaa  cannot  be  defined  a  priori  since  the 
particular  subset  la  dictated  by  external  events,  and 
tha  chosen  activities  of  a  particular  individual  (e.g., 
an  operator).  (31 
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Th*  Attention  Monitor  hold*  the  definition*  of  all 
TOPIC*  of  an  operation.  These  definitions  consist  of 
ail  TASK*  of  a  TOPIC  and  the  ordering  constraints  for 
their  execution;  the  order  operators  specify 
coapietion,  sequencing  and  repetition  requireaents  for 
each  TASK  <and  subsets  of  TASK*)  within  a  TOPIC. 

When  the  Attention  Konitor  receives  an  input  (either 
froa  the  huaan  or  the  host  systea)  it  deteraines  which 
TOPIC  the  input  is  associated  with.  If  it  is 
identified  as  a  TASK  of  an  ACTIVE  TOPIC,  checks  for 
sequence  and  data  range  liait  errors  (if  applicable) 
are  aade;  if  no  errors  are  detected,  the  input,  along 
with  a  TOPIC  IDENTIFIER,  is  passed  to  the  Data  Base 
Interpreter.  If  an  error  is  detected,  the  input  is 
passed  to  the  Huaan  Perormance  Monitor. 

If  the  input  is  not  part  of  an  ACTIVE  TOPIC,  it  is  a 
TASK  of  a  NEW  or  DEFERRED  (i.e. ,  previously  active, 
presently  suspended)  TOPIC.  The  Attention  Monitor 
identifies  the  TOPIC  and  attempts  to  activate  it  by 
issuing  s  request  to  th*  Dvnaaic  Device  Assignment 
(DDA)  aodule.  That  is,  whether  or  not  the  TOPIC  will 
be  activated  depends  prisarily  on  the  availability  of  a 
device  for  that  TOPIC.  The  necessary  evaluations  are 
aade  by  the  DDA  aodule.  If  DDA  determines  that  the 
TOPIC  can  be  initiated,  the  sequencing  structure  (a 
data  array)  for  that  TOPIC  is  integrated  into  the 
Attention  Monitor's  dynaaic  T0PIC  pool. 

Intersensory  Equivalence/Dynaalc  Device  Asslqnaent 

The  tera  ’intersensory  equivalence*  is  a  general 
reference  to  the  huaan  ability  to  represent  the  saae 
inforaation  in  a  number  of  different  says. 
Specifically,  the  representations  may  require  the  use 
of  different  sensory  sodalities.  For  example,  one  may 
speak  a  person’s  name,  which  requires  the  use  of  th* 
speaker's  vocal-linguistic  apparatus  and  the  listener ‘a 
auditory  systea,  or  write  it  on  a  blackboard,  which 
requires  actor  production  and  visual  processing 
systeas,  and  although  th*  physical  representations  are 
quite  different,  th*  meaning  derived  in  both  cases  is 
th*  saae. 

This  *aulti-aodal*  representation  capability  is  used 
frequently,  particularly  when  people  are  having 
problems  communicating  some  idea  <e. g. ,  'Can  you  draw 
me  a  picture??*).  Often  we  are  forced  to  use  a  less 
than  optimal  representation  because  the  modality  of 
choice  is  not  available  under  certain  circumstances 
(*. g. ,  when  we’re  forced  to  use  the  telephone  to  give 
directions  to  some  location).  In  general,  while  it  aay 
be  the  case  that  certain  representations  are  better  or 
worse  for  particular  kinds  of  intormation,  it  is 
certainly  not  the  case  that  there  it  a  single  modality 
(or  a  single  representation  within  a  modality >  lor  the 
coaaunication  of  a  specific  piece  of  information. 

This  huaan  capability  of  intersensory  equivalence 
iaplies,  first  cf  all,  that  alternative  methods  of 
presenting  th™  3ame  inforaation  should  be  a  machine 
capability;  also,  it  should  be  possible  lor  the  machine 
to  know  when  the  huaan  has  communicated  the  same 
information,  even  when  different  data  representations 
have  been  used.  When  this  concept  is  translated  into  a 
human-machine  interface  design  it  means  that  different 
devices  (e. g.,  a  display,  voice  synthesizer,  etc.)  must 
be  capable  of  encoding  or  decoding  the  same  piece  of 
inforaation. 

In  fact,  such  flexibility  in  a  human-aachin*  interface 
is  essentially  dictated  by  two  concurrent  developments; 
the  transition  froa  dedicated  devices  to  general 
purpose  devices,  and  the  growing  complexity  of 
semi-automated  operations.  In  practice,  this  means 
that  a  particular  task  may  not  always  have  access  to 


th*  device  of  choice  if  that  device  is  being  used  by 
soae  other,  possibly  higher  priority,  task. 

Th*  Dynamic  Device  Assignment  (DDA)  component  of  th* 
GHHI  is  responsible  for  generating  the  ’optimal* 
configuration  of  TQPIC-to-(Tlrtual)D*vice  asslgnaenta 
at  each  point  in  an  operation  when  th*  set  of  active 
TOPICS  changes.  A  virtual  device  aay  be  identical  tc  a 
physical  device  (e.g. ,  a  display  unit),  soae  portion  of 
a  physical  device  (e.g.,  a  quadrant  of  a  display),  or 
to  a  cluster  of  portions  of  physical  devices  (e.g.,  two 
quadrants  of  a  display  and  a  voice  synthesizer).  For 
each  TOPIC,  we  specify  th*  set  of  virtual,  devices  which 
can  be  used  for  display  and  control  of  the  inforaation 
of  that  TOPIC,  along  with  the  ’suitability  metric’  for 
th*  us*  of  each  virtual  device. 

The  DDA  configuration  processing  is  initiated  by  a 
request  froa  the  Attention  Konitor  to  activate  a 
previously  inactive  (i.e.,  new  or  deferred)  TOPIC.  The 
resulting  configuration  depends  on  (1)  the  number  and 
identity  of  active  and  to-be-activated  TOPICS,  (2)  th* 
TOPIC  priorities,  (3)  the  available  virtual  devices, 
(4)  th*  TOPIC- to-device  suitability  aetrics,  and  (5) 
the  ’transition  penalty*  for  aovlng  an  already  active 
TOPIC  from  one  virtual  device  to  another. 

This  DDA  processing  aay  result  in  an  active  TOPIC  being 
deferred,  or  in  the  request  for  activation  of  a  new 
TOPIC  being  blocked.  The  Attention  Monitor  is  informed 
of  any  changes  in  th*  set  of  active  TOPIC*  and  makes 
all  necessary  aodifications  to  its  TOPIC  array.  If  any 
configuration  changes  are  required  as  the  result  of  DDA 
processing,  this  information  is  sent  to  the  devices, 
where  the  new  configuration  is  ispleaented. 

Human  Error/Human  Performance  Monitoring 

Noise  is  an  Inherent  property  of  all  physical  systeas. 
In  particular,  noise  in  the  huaan  systea,  which 
soaetiaes  results  in  human  error,  is  an  inescapable 
fact  which  aay  be  reduced  through  training,  education, 
or  high  aotivation,  but  not  eliainated.  Specifically, 
it  can  never  be  designrl  out  of  a  systea  without  fully 
automating  that  systea. 

The  danger  of  huaan  error  in  existing  systeas  results 
froa  the  high  credibility  granted  to  any  huaan  input. 
Those  inputs  are  propagated  through  systems,  and,  if 
they’re  inaccurate,  the  results  may  he  disastrous 
unless  some  human  detects  and  corrects  them  before 
their  full  impact  is  felt.  Of  course,  if  the  correct 
inputs  were  known,  a  priori,  the  process  could  be 
automated  and  th*  requlreaent  for  the  huaan  eliainated. 
Most  of  the  time,  however,  the  huaan  role  in  a  systea 
is  to  perform  those  tasks  which  we  don’t  know  how  to 
automate.  This  presents  a  dileasa:  How  can  a  system 
detect  an  error  if  it  doesn't  know  what  the  correct 
input  should  be? 

A  related  issue  concerns  how  well  the  huaan  should  (or 
must)  perform  m  systeas.  Systea  designers  usually 
specify  worst  case  limits  on  huaan  task  performance, 
such  as  aaximua  time  allowed  for  a  particular  action  or 
RMS  erroi  lor  soae  entry.  These  values  are  often 
arrived  at  by  an  arithmetic  manipulation  which  takes 
the  total  system  throughput  requirement,  subtracts  the 
cont.  ibution  of  the  non-huaan  (i.e..,  designed) 
components,  and  specifies  the  rvsidue  as  the  tolerable 
limits  on  human  performance.  ,  While  some  comparison  of 
human  requirements  and  human  capabilities  is  made  to 
ascertain  whether  the  human  will  be  able  to  perform  as 
specified,  the  analyses  are  most  often  superficial.  IS) 

There  are  a  number  of  issues  to  be  considered  if  the 
OHMI  is  to  reduce  the  frequency  of  huaan  errors. 
First,  the  human  component  Is  not  a  system  module 
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designed  by  huaan  brings)  thus,  sny  predictions  o f 
hussn  performance  sust  cos*  from  aodsls  derived  from 
observation  of  huaan  behavior.  A  particular  huaan's 
perforaance  can  be  judged  as  good  or  bad  only  vith 
respect  to  normative  humm.  perforaance;  it  can  only  be 
said  to  be  desirable  or  undesirable  with  respect  to  any 
other,  arbitrarily  specified,  design  requirement. 

Second,  aany  husan  errors  are  not  content  errors;  delay 
tine  in  reaction  to  events,  aultiple  corrections  to  a 
single  input,  and  total  tiae  to  construct  an  input  are 
all  associated  vith  poor  perforaance  and  could  serve  as 
data  in  the  assesaaent  of  perforaance. 

Third,  any  aeaaure  of  huaan  perforaance  is  a  (possibly 
n-dlaenslonal)  randoa  variable;  therefore,  decisions 
about  perforaance  auat  be  based  on  statistical 
inference. 

Fourth,  the  errors  vhich  can  be  detected  by  the  aachine 
auat  be  caught  at  a  point  in  the  Systea  vhich  is  near 
to  the  huaan  vho  Bade  thea,  before  they  are  passed  on 
to  the  systea  at  large  as  a  huaan  input;  aoreover,  an 
atteapt  to  correct  these  errors  should  be  sade  ut  the 
same  local  level  as  their  detection. 

Fifth,  huaan  perforaance  monitoring  having  features  of 
the  preceding  four  points  Bust  be  a  continuous  and 
(near)  real-tiae  HHI  capability.  Individual  huaan 
perforaance  vill  vary  as  a  function  of  task,  day,  and 
tiae  of  day;  perforaance  evaluation  on  any  crude  tiae 
schedule  will  do  little  to  reduce  systea  failures 
attributable  to  huaan  error. 

A  Huaan  Perforaance  Monitor  (HPH)  having  the  properties 
described  above  is  suaaarized  in  Figure  4.  At  present, 
the  HPH  coaponent  implemented  in  the  GHHI  ayatea  has  no 
assesaaent  capability  and  only  ainiaal  aeasureaent  and 
feedback  abilities.  Design  vork  for  the  coaplete 
aod"le  is  in  progress. 


Currently,  there  are  a  nuaber  of  errors  that  can  be 
detected  by  the  Natural  Language  Processor  and  the 
Attention  Monitor.  These  are  passed  to  the  HPH  nodule 
vhere  one  of  two  forma  of  feedback  ta  given  to  the 
huaan.  ’Quick  Mode*,  designed  for  experienced 
operatora  and  urgent  situations,  vaaediately  points  out 
the  operator’s  aistake  to  his.  He  is  then  directed  to 
enter  a  corrected  coaaand,  or  in  those  cases  vhere  the 
systea  can  guees  at  what  the  operator  *  ntended,  he  la 
asked  to  verify  the  syatea-corrected  coaaand.  Upon 
positive  verification,  the  coaaand  is  executed. 


’Teach  Mode*  is  based  an  the  philosophy  that  a  huaan 
who  discovers  his  own  errors  is  less  likely  to  repeat 
thea  than  one  vho  aerely  has  his  errors  pointed  out  to 
his.  Thus,  when  an  inexperienced  operator  in  a 
non-eritlcal  situation  aakes  a  aistake,  he  is  taken 
through  a  series  of  successively  acre  explicit  hints 
about  the  identity  of  the  error;  after  each  aessage  he 
is  asked  to  correct  the  error  if  he  can.  If  he  cannot 
find  his  aistake  after  all  hints  have  been  supplied, 
the  error  is  pointed  cut  and  he  is  asked  to  re-enter 
the  coaaand. 

Data  Base  Interpreter 

The  Data  Base  Interpreter  (DBI)  provides  a  mechanism 
for  storage  and  retrieval  of  information  used  by  the 
HHI  systea.  This  information  aay  coae  from  the  huaan 
or  the  host  systea,  and  aay  be  rapidly  changing, 
relatively  static,  or  even  a  constant  value,  and  nay  be 
continuously  in  deaand  or  of  only  momentary  relevance. 
The  design  of  DBI  was  motivated  by  these  attributes  of 
the  inforaation  and  its  storage  and  retrieval  needs, 
and  not  by  any  explicitly  huaan  characteristics,  as  was 
the  case  for  the  other  HHI  modules. 

Another  consideration  in  the  design  of  DBI  was  the 
philosophy  of  a  general  HHI  architecture,  demanding 
that  any  changes  required  by  a  new  application  be 
acaoaplished  by  changes  of  data,  rather  than  by 
recoding  of  the  HHI  software  itself.  As  a  result  of 
this  approach,  any  specialized  processing  required  by  a 
specific  application  bus*  be  realized  within  the 
application-specific  data  base,  and  executed  by  the 
general-purpose  Data  Base  Interpreter. 

In  the  current  iapleaentation,  this  concern  for 
separation  of  application-specific  data  from 
general-purpose  code  has  been  satisfied  by  iapleaenting 
all  application-specific  processing  in  interpretive 
instructions  which  are  stored  vith  each  data  base 
object  and  are  executed  by  DBI.  This  approach  has  the 
disadvantage  of  being  inefficient  for  high  voluae 
eoaputation  or  very  large  data  sets;  these  deaands 
could,  however,  be  satisfied  by  other  parts  of  the 
systea  outside  the  Huaan  Machine  Interface,  or  through 
execution  of  compiled  subroutines  by  DBI,  at  the 
expense  of  pure  software  generality. 

Two  aajor  advantages  of  eaploying  Interpretive 
instructions  for  application-specific  processing  are, 
first,  the  ease  vith  vhich  the  systea  can  be  enhanced 
to  satisfy  new  requirements  and  support  new 
capabilities  (e. g. ,  new  sensors  or  new  sensor 
processing  algorithas),  and  second,  the  efficiency  vith 
which  operator  tasks  can  be  reallocated  among  HHI-based 
workstations,  to  satisfy  the  anticipated  requlreaents 
of  distributed  processing  systeaa,  including 
distributed  Coaaand  and  Control.  In  the  extreme,  the 
role  of  an  individual  workstation  could  be  completely 
redefined  by  the  downloading  of  data. 

The  techniques  used  by  DBI  for  storage  and  for 
retrieval  are  analogous  to  forward-chaining  and 
backward-chaining,  respectively.  That  is,  when  new 
inforaation  is  provided  to  DBI,  it  is  stored  as  the 
value  of  the  appropriate  data  base  object,  and  the 
values  of  other  derived  objects  vill  also  be 
autaaatically  updated  if  they  are  listed  ae  RECIPIENTS 
of  the  original  object.  This  ’forward-chaining*  of 
updated  values  say  continue  to  recipient  objects 
outside  of  the  database  --  the  huaan  operator  or  the 
host  systea  --  if  one  of  thea  is  to  be  informed  of 
updated  values. 

Vhsn  retrieval  of  information  is  requested,  DBI  say 
have  the  exact  inforaation  already  in  storage,  as  a 
recipient  of  the  automatic  updating  process  described 
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above.  Mot  all  object*  are  handled  In  thla  way, 
however,  aince  aoae  object*  are  requested  such  lees 
frequently  than  their  coaponent  information  ia  updated. 
For  thus*  objects,  retrieval  requires  the  derivation  of 
the  object's  value  froa  the  stored  values  of 
lover-level  DONOR  objects.  This  ’backward-chaining* 
retrieval  of  current  values  can,  for  soae  objects, 
result  in  requests  for  indorsation  being  issued  tn  the 
huaan  or  the  host  if  on*  cf  the*  ia  the  ultiaat*  source 
of  such  inforaation.  Thus,  DB1  aanagea  the  flow  of 
indorsation  between  huaan  and  host  by  providing  both 
autoaatlc  updating  and  retrieval  of  data  upon  request. 


Figure  S  illustrates  the  architecture  vhich  ties  these 
aajor  GHHI  ‘coaaunication  Intelligence*  coaponent* 
together,  along  with  their  supporting  structures;  the 
design  is  implemented  in  Pascal. 


FIGURE  3 


An  Application 

To  test  the  HHI  algorithms,  a  human  must  interact  with 
thea.  For  that  to  be  possible,  there  must  be 
coaaunication  devices  and  tasks  for  the  huaan  to 
perfora.  We  have  designed  and  fabricated  a  prototype 
console  vhich  houses  two  touch-sensitive  high 
resolution  displays  (one  color,  one  aonochroae),  a 
keyboard,  and  a  voice  synthesizer.  This  console  serve* 
as  a  testbed  for  the  GHHI,  and  allova  us  to  investigate 
the  use  of  new  input/output  devices. 

The  operational  tasks  ve've  developed  would  be  used  in 
aost  command  and  control  situations.  At  present,  the 
land-based  scenario  includes  map  set-up,  friendl,  and 
enemy  asset  review  and  display,  and  network  message 
review.  Help  functions,  and  alphanumeric  readouts, 
vhich  provide  graphic  symbol  translation  and  status 
information,  are  also  available.  A  photograph  of  one 
interactive  screen  display  appears  in  Figure  6.  At 
this  time  in  the  operation,  the  Friendly  Assets  and 
Enemy  Assets  Review  TOPICS  have  been  activated  along 
with  the  situation  display.  The  bar  across  the  top  of 
keys  indicates  an  on-state:  in  the  main  menu  at  the 
bottom  of  the  screen,  it  indicates  the  activation  of 
topics;  in  the  different  TOPIC  quadrants,  it  has  the 
effect  of  plotting  the  designated  units  on  the 
situation  display.  171  The  Dynamic  Device  Assignment 
module  is  responsible  for  arriving  at  this  particular 
display /control  configuration  for  the  active  topics. 


The  next  TOPIC  to  be  developed  would  include  tools, 
such  as  unit  position  projection,  for  target 
correlation,  fusion  and  aggregation. 


FIGURE  6 


The  last  piece  of  the  application  is  the  'host*  in  the 
systea.  In  the  present  case,  the  host  is  a  message 
network.  A  simulation  advances  330  enemy  tanks  and 
armored  personnel  carriers  in  the  area  of  interest;  two 
line-of-sight  sensors  scan  sectors  of  the  battlefield 
and  report  number  and  position  of  sighted  targets. 
Sensor  sassages,  with  the  date-time  group  of  sighting, 
are  generated  and  sent  to  the  HHI.  Thea*  data  form  the 
basis  of  the  analysis  work  performed  by  an  operator. 

The  full  systea  is  implemented  on  two  microprocessors, 
with  the  allocation  of  processing  as  shown  in  Figure  7. 
In  the  past  v*  planned  that  the  HHI  software  would  run 
on  a  air.ole  processor;  today  w*  anticipate  that 
multiple  processors,  possibly  one  dedicated  to  each 
nodule,  will  eventually  be  necessary  to  handle  the 
high-speed  processing  requirements  of  the  systea.  We 
have  not  begun  to  explore  the  implications  of  such  a 
parallel  architecture.  The  systea  has  just  recently 
become  fully  operational.  Therefore,  there  are  no  data 
aa  yet  on  system  or  human  performance. 


GHHI 's  Role  in  Distriouted  Systems 


As  envisioned  by  sany  people,  the  architecture  of 
distributed  inforaation  processing  and  decision  systems 
will  require  that  resources  be  shared  among  individuals 
in  the  network,  that  collaborative  analysis  of 
situations  be  possible,  and  that  the  allocation  of 
responsibilities  to  a  particular  individual  be 
dynamically  defined  as  a  scenario  develops.  To  this 
point,  we  have  addressed  a  single  nod*  of  a  network, 
consisting  of  one  human  and  the  HHI  systea.  What 
information  could  the  HHI  subsystem  contribute  to  the 
*ull  network? 

The  HHI  has  eoment-to-moaent  knowledge  of  the  tasks  an 
individual  ia  engaged  -in  and  of  his  current 
performance.  In  particular,  it  would  recognize 
indications  of  overload  or  fatigue,  or  of  highly 
efficient  functioning.  Such  inforaation  on  all 
individuals  in  a  network  could  be  sent  to  an 
Independent,  higher-level  systea  module  responsible  for 
task  allocation.  With  the  addition  of  other  resident 
knowledge  (such  as  doctrine),  this  module  would  have 
the  necessary  information  to  dynamically  define  task 
assignments  for  each  workstation  *o  as  to  optisiz* 
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Condition 


total  ayataa  parforaancc.  (Kotlca  tha  siailarity,  by 
analogy,  to  th#  GHHI  Dynaalc  Davlca  Aaaignaant  a.odule; 
in  tha  distributed  ayataa  eaaa,  <n  individual  iti  a 
•davica*  and  hla  aat  of  responsibilities  la  a  'TOPIC.*' 


arc  amwCTi'.f  uucts  me. 

nr  wo  comm  mm 


FIGURE  7 


This  natvork  capability,  suaaa'lzed  In  Figura  8,  la 
only  ana  of  many  uaaful  features  of  tha  GHHI.  It  «aa 
choaan  lor  dlacuaaion  ainca  auch  a  natvork  function 
would  not  ba  raallatlc  without  tha  datallad  knovladga 
of  Individual  huaan  parforaanca  and  activity  that  can 
ba  accessed  through  tha  GHHI. 


He  hava  providad  an  introductory  ovarviav  of  tha 
Ganarallzad  Huaan*Hachina  Intarfaca  designed  and 
davalopad  in  our  laboratory.  Ha  believe  It  to  be  a 
unique  approach  to  tha  architecture  of  an  intarfaca 
between  huaan  and  Machine,  cna  which  should  facilitate 
high-level  coaaunication.  Our  prototype,  however,  is 
just  tha  beginning  of  what  will  be  a  such  longer,  but 
fascinating  endeavor  which  will  depend  on  the  evolving 
understanding  of  huaan  inforaation  processing  and  tha 
revolution  in  coaputer  technologies. 
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JA  new  generation  of  decision  aids,  based  on  person- 
lized  decision  analysis,  is  being  developed  to  support 
he  judgmental  component  of  tactical  They  structure 

and  quantify  judgments  of  uncertainty  and  value,  with  a 
view  to  enhancing  the  soundness  of  decisions  and  reducing 
the  decision  maker's  burden.  Attack  submarine  examples 
are  presented  directed  at:  assessing  target  range; 
alerting  to  dangers  and  opportunities;  and  choosing  a 
time  of  fire.  £ 

Problem:  How  to  Support  the  Judgmental 
_ Componen t  of  Tactical  C^ _ 

Tactical  command  and  control  involves  decisions  on 
how  to  respond  to  changing  awareness  of  dangers  and  op- 
port  unities,  for  example  when  to  fire  a  weapon  at  a  tar¬ 
get.  These  decisions  inevitably  involve  a  blend  of 
hard  data  (for  example  from  sensors)  and  personal  judg¬ 
ment  (say,  on  the  quality  of  sensor  output).  As  tactical 
warfare  and  the  threats  it  addresses  become  more  complex, 
the  judgmental  component  comes  under  increasing  stress, 
particularly  in  the  context  of  distributed  decision 
making.  Decision  makers  and  their  superiors  find  them¬ 
selves  in  need  of  aids  which  will  make  the  judgmental 
casks  easier  to  perform  effectively. 

Personalized  Decision  Analysis  (PDA) 
as  a  Promising  Tool 


The  Office  of  Naval  Research  has  funded  a  program 
of  applied  research  to  adapt  the  tools  of  personalized 
decision  analysis  (PDA)  for  this  purpose. [1]  FDA  is  a 
recently  flourishing  technique,  grounded  in  statistical 
decision  theory  and  engineering  psychology.  In  principle, 
it  can  be  used  to  enhance  the  quality  of  decisions,  to 
reduce  workload,  and  in  some  cases,  to  manipulate  the 
command  and  control  process.  Its  essence  is  to  quantify 
the  judgmental  elements  in  a  decision  task  end  to  display 
their  action  implications.  In  particular,  uncertainty 
is  measured  by  probability,  value  tradeoffs  arc  measured 
by  utility,  and  the  preferred  action  is  characterized  as 
the  One  with  the  highest  expected  utility. 

In  1979  a  new  effort  was  initiated  by  Engineering 
Psychology  Programs  of  ONR  through  Decision  Science  Con¬ 
sortium,  Inc.,  aimed  at  validating  the  practical  value  of 
this  concept. [2]  The  intent  was  to  foster  in  the  fleet 
the  actual  implementation  of  at  least  one  concrete  vari¬ 
ant.  This  was  both  to  provide  a  stringent  test  of  the 
applied  merit  of  the  concept  and  to  uncover  the  most  pro¬ 
mising  directions  for  further  development. 

Submarine  ASW  as  a  Testbed 

Attack  submarine  command  and  control  was  selected 
as  the  testbed  3nd  within  it  the  approach  and  attack  ASW 
scenario.  The  initial  thrust  of  the  effort  was  to  re¬ 
view  the  range  of  command  and  control  activity  in  an  at¬ 
tack  submarine  in  order  to  identify  situations  of  partic¬ 


ular  promise.  In  particular,  we  looked  for  command  judg¬ 
ments  where  there  was  significant  room  for  improvement, 
where  any  improvement  would  have  a  high  impact  on  a  mis¬ 
sion,  and  where  PDA  appeared  to  have  something  to  con¬ 
tribute.  Three  types  of  judgment  were  considered: 
assessment,  alertment,  and  action  selection.  The  situ¬ 
ation  identified  for  detailed  attention  was  approaching 
an  enemy  submarine  with  intent  to  attack,  which  included 
each  type  of  judgment.  Specifically,  we  addressed  the 
judgmental  tasks  of  assessing  target  range  (which  be¬ 
came  the  crimary  initial  focus  of  our  project),  alerting 
to  critical  situations,  and  choosing  the  time  to  fire  a 
torped  i. 

In  each  case,  the  object  was  to  explore  and  develop 
a  technology  for  aiding  submarine  commanders  and  their 
staffs  in  making  these  judgments;  specifically  to  help 
them  to  assimilate  the  data  and  the  expertise  available 
to  them  and  to  combine  them  effectively,  with  contribu¬ 
tions  from  their  own  judgment. 

Assessing  Target  Range 

The  judgmental  task  chosen  for  our  primary  effort 
was  the  assessment  of  target  range  (including  a  margin 
of  uncertainty)  in  the  light  of  multiple  conflicting 
estimates.  A  logical  algorithm  for  this  task  has  been 
developed  with  support  from  the  Mathematical  Sciences 
Division  of  ONR. [3,4]  It  takes  individual  solutions 
(estimates)  and  assessments  of  their  accuracy  (and  in¬ 
terdependencies)  and  generates  a  pooled  estimate  with 
a  band  of  uncertainty  changing  over  time  (Figure  1). 
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Figure  1.  Illustrative  Range  Pooling  Display 

The  source  of  the  accuracy  estimates  may  be  personal 
judgment  or  previous  analysis  (say,  by  fitting  para¬ 
meters  to  exercise  data).  The  algorithm,  appears  well 


106 


enough  established  now  to  permit  primary  attention  to 
be  devoted  to  user  interface  issues.  An  early  version 
of  the  algorithm  showed  substantial  improvements  over 
individual  solutions  and  over  the  "system  solution"  when 
tested  on  fleet  exercise  (Rangex)  data. [5]  Work  on 
engineering  psychology  aspects  of  eliciting  judgmental 
Inputs  and  presenting  outputs  is  being  supported  by  the 
Psychological  Sciences  Division  of  ONR,  [6,7] 

An  interactive  graphic  computer  program  has  been 
developed  based  on  the  responses  of  subjects  representa¬ 
tive  of  ultimate  users,  in  situations  which  approximate 
realistic  engagements.  Figure  1  shows  a  typical  display 
In  the  context  of  which  the  usar  can  change  the  bias  ad¬ 
justment  (shown  as  a  wiggly  bar)  or  the  80%  limits,  and 
see  displayed  the  resulting  change  in  the  pooled  assess¬ 
ment  (shown  at  right).  Figure  2  shows  another  form  of 
desired  output,  in  which  pooled  assessments  are  plotted 
against  time.  Implementation  issues  (including  inter¬ 
face  options)  have  been  proposed  for  investigation  in 
an  experimental  testbed  in  collaboration  with  a  Navy 
laboratory  (NUSC).  Subsidiary  PDA  aids  for  assessing 
error  in  specific  range  estimates  as  a  function  of 
several  sources  of  error,  have  also  been  explored. 


Time 


Figure  2.  Time-Range  Display 


of  the  target's  being  within  own  ship's  weapon  range, 
and  range  accuracy  being  within,  say,  2000  yards.  The 
aid  emits  a  signal  when  critical  thresholds  ol  probabil¬ 
ity  (which  may  be  preset  or  set  by  the  user)  are  reached 
(shown  as  broken  lines  in  display).  Assessments  from 
which  the  probabilities  are  derived  are  shown  on  the 
right  to  permit  validation  by  the  user.  (Note  that  they 
all  involve  range  accuracy,  which  would  be  output  from 
the  range  pooling  aid,  discussed  above.) 

The  time-of-fire  decision  (how  long  should  I  wait, 
attempting  to  remain  undetected,  before  launching  an 
attack?)  involves  both  uncertainty  and  value.  There  are 
the  competing  values  of  killing  the  target  and  preserving 
own  ship,  and  uncertainties  about  the  consequences  of 
any  given  course  of  action.  Timing  is  of  the  essence: 

A  premature  attack  may  both  alert  the  enemy  and  miss; 
yet  delay  increases  the  risk  that  the  SSN  will  be  detec¬ 
ted  before  attack  or  that  the  opportunity  to  attack  will 
slip  sway.  For  the  tlme-of-fire  decision  aid,  all  these 
considerations  are  put  together  into  a  "decision  tree," 
described  in  detail  in  [6J. 

Figure  4  shows  the  basic  features  of  the  t^me-of- 
fire  decision  aid.  It  predicts  thr  relative  value  of 
waiting  to  shoot  at  various  points  in  time  from  the  pre¬ 
sent,  given  a  choice  of  own  ship  maneuvers.  The  point 
at  which  relative  value  is  maximized  is  the  recommended 
time,  ail  things  considered,  at  which  shooting  has  the 
highest  expected  payo'f.  For  purposes  of  comparison, 
the  straight  horizontal  line,  marked  "Retire  now,  is 
the  value  of  simply  leaving  the  scene.  (As  the  approach 
scenarios  proceed,  of  course,  the  display  would  be  up¬ 
dated  to  reflect  new  information. )  Such  a  display  would 
have  two  functions:  to  help  decide  now  whether  or  not 
to  fire,  and,  if  the  decision  is  to  wait,  to  help  make 
preparations  for  a  future  firing. 
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Planning  Attack  and  Determining  Time-of-Fire 


In  addition  to  target  range  pooling,  other  aids 
supporting  the  time-of-fire  decision  have  been  ad¬ 
dressed — for  assessing  the  probability  of  hitting  the 
target,  alerting  to  critical  situations,  as  well  as 
actually  recommending  a  time-of-fire.  These  aids  are, 
however,  at  an  earlier  stage  of  development:  the  logical 
algorithms  have  been  identified;  the  form  of  user  inter¬ 
face  has  been  anticipated;  and  further  development  is 
being  supported  by  IBM  Federal  Systems  Division. 


Figure  3  shows  the  display  format  of  an  alerting 
aid.  It  monitors  range-related  probabilities  such  as 
being  within  a  certain  target's  counterdetection  range. 


STATUS 

CURRENT  STATUS  -  THRESHOLD  VALIDALTION 


Figure  3.  Alerting  Aid:  Output 


*  Conditional  on  Path 
0/S  Maneuver  Assumed 


Figure  4.  Attack  Planning  Aid:  Output  Display 


A  critical  issue,  currently  being  explored,  is  how 
best  to  provide  these  inputs  in  a  way  that  does  not  put 
an  unacceptable  burden  on  submarine  command  staff  in  the 
heat  of  battle.  The  intention  is  to  have  the  displays 
largely  driven  by  mere  primary  input  data  generated  rou¬ 
tinely  on-board  or  from  the  results  of  previous  analysis. 


A  Special  Role  for  PDA  in  Distributed  C 


PDA  may  have  a  particularly  promising  role  to  play 
in  distributed  tactical  systems.  Decision-analytic 
aids  can  influence  the  decisions  of  subordinate  comman¬ 


ders  in  the  system  so  as  to  balance  autonomy  from  and 
control  by  higher  authorities. 


For  example,  an  aid  to  indicate  when  a  submarine 
weapon  should  be  fired,  such  as  we  have  discussed,  would 
incorporate  assessments  of  uncertainty  (for  example. 
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probability  of  kill  and  probable  target  range)  and  value 
judgments  (for  example,  the  relative  Importance  of  kill¬ 
ing  and  avoiding  being  killed) .  The  inputs  to  the  aid 
may  be  specified  ahead  of  time  by  higher  authority  (for 
example,  value  tradeoffs  and  the  assessment  of  threat 
characteristics  and  sensor  performance) ;  or  they  can  be 
provided  on  the  spc-  b/  the  aid  user  (say,  an  assessment 
of  target  intent).  The  aid  can  be  used  centrally  by  a 
Battle  Group  commander  or  by  a  subordinate  commander. 

The  output  of  the  aid  can  be  binding  on  the  subordinate 
or  merely  suggested  guidance. 

The  degree  of  centralized  control  can  be  determined 
by  how  the  use  of  this  aid  is  specified  along  such  dimen¬ 
sions,  for  example  by  discretion  in  specifying  inputs  or 
in  acting  on  output.  That  control  can  be  changed  by  con¬ 
tingency  rules  on  the  aid's  use,  for  example  to  control 
the  subordinate  commander  tightly  under  routine  circum¬ 
stances,  but  -o  give  increased  autonomy  if  communication 
with  Battle  Group  command  should  be  broken.  Even  when 
the  quantification  and  use  of  the  aid  is  fully  dele¬ 
gated  to  a  subordinate  commander,  it  may  provide  a 
reviewable  record  of  his  decision  process,  which  may 
influence  him  to  act  in  closer  accordance  with  higher- 
level  policy  (as  rejected  in  default  inputs  to  the  aid). 

Figure  5  illustrates  how  the  aid  might  be  used  to 
affect  the  degree  of  centralization  of  control.  In  the 
decentralized  mode  (hatched  and  solid  lines)  the  sub¬ 
marine  commander  is  informed  by  the  aid  but  makes  his 
own  decision.  In  the  centralized  mode  (hatched  and 
dotted  lines)  he  is  required  to  follow  the  indication 
of  the  aid.  (Note  that  many  other  variations  are  possi¬ 
ble  by  changing  the  hatched  lines,  for  example  by  giving 
the  local  commander  a  say  in  the  value  Inputs,  or  the 
central  commander  a  say  in  the  uncertainty  inputs.) 


I'  )  44-  Both  Modes 


Figure  5.  How  Aid  Might  Affect  Centralization 


Conclusion 

We  have  argued  that  there  may  be  an  increasing 
role  for  tactical  decision  aids  which  quantify  judgment. 
The  danger,  of  course,  is  that  they  may  demand  more  Judg¬ 
ment  in  their  input  than  they  save  in  the  judgmental 
task  they  are  designed  to  aid.  This  is  why  we  attach  so 
much  Importance  to  the  human  engineering  aspects  of  aid 
development .  Solving  the  people  problems  is  more  impor¬ 
tant  than  solving  the  logic  problems. 
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Abstract 


lif  i Cr*-/i 

•Nf  suamarizeykey  features  of  an  interactive  plan¬ 
ing  and  decision  support  process  for  multiple  criteria 
Iternative  selection  situations.  Probabilities, 
;ility  scores  for  the  lowest  level  attributes,  and 
^tribute  tradeoff  weights,  i  e. ,  the  paraaeterc,  can 
.  :  imprecisely  described  by  set  inclusion.  Within  a 
jecified  structural  model  of  the  decision  situation, 
le  process  allows  the  decisionmaker  to  iteratively 
sleet  the  mix  of  parameter  value  precision  and  alter- 
itive  ranking  specificity.  By  selecting  this  mix,  the 
scisionmaker  is  able  to  direct  the  alternative  selec- 
Lon  process  in  an  interactive  manner ,  using  altema- 
Ive  selection  strategies  based  on  behaviorally  mean- 
lgful  dominance  search  strategies.  Emphasis  is  placed 
i  the  motivation  of  the  research  and  the  behavioral 
•levance  of  the  support  process.  ^r** 

1.  Introduction 


activities  as  deemed  appropriate  for  the  task  at 
hand  (1,2).  Especially  when  there  are  a  large  number 
of  alternative  courses  of  action  under  consideration, 
the  decision  process  will  typically  involve  mired 
scanning,  where  some  noncompensatory  rule  is  first  used 
to  eliminate  grossly  inappropriate  alternatives.  This 
is  then  followed  by  one  or  more  compensatory  informa¬ 
tion  evaluation  operations  that  results  in  a  dominance 
structure  which  enaoles  final  judgment  and  alternative 
selection. 

The  research  discussed  here  is  based  upon  the 
hypothesis  that  people  are  able  to  evaluate  alternative 
plans  and  decisions  efficiently  and  effectively,  and 
with  low  stress,  when  there  is  a  clear  dominance  pat¬ 
tern  among  alternatives  that  enables  establishment  of  a 
sufficiently  discriminatory  priority  structure.  Our 
goal  is  to  provide  a  knowledge  based  decision  support 
process  that  enhances  the  quality  of  the  dominance 
structure  used  for  judgment  and  choice. 


The  process  of  choosing  among  multiattri'outed 
ltematives  often  Involves  an  initial  search  for  a 
ominance  structure  and  ultiaiate  identification  of  a 
et  of  nondomlnated  alternatives.  By  definition,  a 
ondominant  alternative  is  one  which  is  not  worse  than 
ny  other  alternative  on  any  attribute  and  which  is 
etter  than  each  other  alternative  on  at  least  one 
ttribute.  In  most  decislou  situations,  however,  there 
.•  no  single  alternative  that  dominates  all  other 
lternatlves,  at  least  initially.  In  such  decision 
ituations,  the  decisionmaker  typically  "adjusts"  the 
tructure  of  the  decision  situation,  and  parameter 
slues  within  this  structure,  so  as  to  identify  a 
ominance  structure  which  contains  a  single  nondominant 
lternative.  This  search  may  involve  rational  setivi- 
lea,  such  as  aggregation  of  attributes  and  compensa- 
ory  tradeoffs  through  determination  of  judgmental 
mights.  Alternately,  it  may  involve  various  rules 
■hich  may  be  quite  flawed.  Examples  of  such  rules  are 

i)  lexicographic  ordering,  in  which  the  best  alter- 
istive  on  the  most  important  attribute  is  selected,  and 

ii)  sequential  pairwise  comparison  of  alternatives 
ising  a  preference  relation  that  is  a  function  of  the 
-wo  alternatives  being  compared.  In  this  latter  case, 
lontransitive  preferences  may  easily  result  due  to  the 
'act  that  the  contextual  relation  used  to  determine 
•references  changes  from  binary  comparison  to  binary 
romparison. 

A  variety  of  holistic,  heuristic,  and  wholistic 
judgmental  activities  will  typically  be  involved  in  the 
■eazch  for  a  dominance  structure  among  the  alterna¬ 
tives.  These  take  on  various  forms  and  mixtures  of 
formal  knowledge  based,  rule  based  or  skill  based 


The  next  section  will  present  a  summary  discussion 
of  the  features  and  structural  constructs  of  our  deci¬ 
sion  support  system.  The  following  section  presents  a 
more  detailed  discussion  of  these  structural  constructs 
and  introduces  some  of  tbe  modes  in  which  the  support 
process  can  be  used.  Then  we  discuss  some  behavioral 
issues  that  relate  to  the  conceptual  design  of  ARIADNE. 
The  list  of  references  contains  citations  to  a  number 
of  works  which  discuss  the  algorithmic  content  of  the 
decision  support  system. 

2.  Festures  of  the  Decision  Support  System 

We  now  investigate  concepts  for  the  design  and 
evaluation  of  an  interactive  knowledge  based  planning 
and  decision  support  system  which  combines,  or  allows 
combination  of,  several  evaluation  rules  and  contin¬ 
gency  structures  often  used  as  a  basis  for  evaluation, 
prioritization,  judgment,  and  choice.  We  have  de¬ 
veloped  a  knowledge  based  system  to  interactively  aid 
planning  and  decision  support  processes  through  en¬ 
couragement  of  search  for  a  dominance  structure  that  is 
behaviorally  realistic  and  rational,  from  both  a  sub¬ 
stantive  and  procedural  viewpoint.  The  support  system 
is  called  ARIADNE  for  Alternative  Ranking  Interactive 
Aid  based  on  Dominance  structural  information  Elicita¬ 
tion.  The  support  system  enables  use  of  various  inte¬ 
grated  forms  of  wholistic,  heuristic,  and  holistic 
reasoning  in  an  aided  search  for  dominance  information 
asx>ng  identified  alternatives.  We  believe  it  to  be 
flexible  enough  to  closely  match  diverse  decision 
situations  and  environments  in  order  to  support  varying 
cognitive  skills  and  decision  styles,  thereby  enabling 
planners  and  decision  sukers  to  adapt  its  use  to  their 
own  cognitive  skills,  decision  styles,  and  knowledge. 


Our  efforts  have  concerned  choice  making  situa¬ 
tions  under  certainty  and  under  risk,  priaurily  for  the 
single  decision  node  case.  This  formulation  allows 


Ihis  research  was  supported  by  the  Office  of  Naval 
teaearch  under  Contract  N00014-80-C-0542. 
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consideration  of  a  variety  of  imprecisely  known  param¬ 
eters  such  as:  attribute  tradeoff  weights,  outcome 
state  values  on  lowest  level  attributes,  event  outcome 
probabilities,  and  various  combinations  of  these. 
Parameter  needs  are  determined  from  the  structure  of 
the  decision  situa  tion,  as  elicited  from  the  decision¬ 
maker  during  the  formulation  and  analysis  steps  of  the 
decision  support  process.  We  consider  these  formula  - 
tion  and  analysis  steps  to  be  outside  the  scope  of  our 
present  software  developments  but  recognize  the  essen¬ 
tial  need  for  them  in  a  complete  decision  support 
process. 

The  decision  situation  structural  model  may  repre¬ 
sent  decisions  under  risk  or  under  certainty.  The 
attribute  tree  representing  the  features  of  decision 
outcome  states  may  be  structured  and/or  parameterized 
in  a  top-down  or  bottom-up  fashion  through  use  of 
ARIADNE.  A  single  level  structure  or  a  multiple  level 
hierarchical  structure  of  attributes  may  be  used  with 
the  choice  of  these  being  at  the  discretion  of  the 
decisionmaker.  Multiple  decision  node  situations  may 
be  approached  through  a  goal  directed  decision  struc¬ 
turing  approach  in  which  Che  growth  of  the  structure  of 
alternative  decisions  and  event  outcomes  is  guided  by 
sensitivity-like  computations  obtained  through  use  cf 
the  ARIADNE  algorithms  13-5] .  . 

Parameters  <  re  elicited  from  the  decisionmaker  in 
the  fora  of  equalities  and  inequality  bounds.  A  var¬ 
iety  of  mathematical  programming  approaches  and  graph 
theory,  have  been  used  to  generate  interactive  displays 
of  preference  digraphs.  These  mathematical  programing 
approaches  are  used  to  determine  dominance  structures 
for  alternative  prioritization  that  are  based  on  para¬ 
meter  information  elicited  from  the  decisionmaker.  At 
present,  only  a  linear  programing  approach  will  yield 
necessary  and  sufficient  conditions  for  determination 
of  a  priority  structure  and  computational  times  that 
are  consistent  with  interactive  decision  aiding.  This 
requires  that  we  elicit  structural  parameter  infor¬ 
mation  in  a  slightly  restricted  form  which  we  denote 
the  "behaviorally  consistent  information  set”  (BCIS). 
Often  this  BCIS  will  be  in  such  a  form  that  solution  of 
the  generally  nonlinear  programing  problems  associated 
with  determination  of  dominance  structures  can  be 
replaced  by  the  solution  of  simple,  computationally 
amenable  linear  programs  with  bounded  variables.  The 
major  simplification  associated  with  eliciting  param¬ 
eter  imprecision  in  a  prespecified  structural  format, 
however,  is  in  the  natural  language  dialogue  needed  to 
establish  a  model  of  the  decision  situation. 

The  purpose  of  the  graph  theory  algoritnms  is  to 
enable  construction  of  a  domination  digraph,  or  dom¬ 
inance  structural  model.  This  digraph  is  a  pictorial 
representation  of  the  ordinal  preferences  as  determined 
from  a  dominance  reachability  matrix.  This  matrix  is 
determined  by  the  linear  programming  algorithms  from 
the  decision  situation  structural  model  and  parameters 
elicited  from  the  decisionmaker.  These  domination 
digraphs  encourage  either  selection  of  a  preferred 
alternative,  or  further  iteration  using  the  aggregated 
preference  inforir*aiion  for  feedback  learning. 

An  inverse  aiding  feature  is  currently  being 
incorporated  into  the  decision  support  system  (6,71. 
This  feature  allows  the  decisionmaker  to  make  whol- 
istic,  skill  based  prioritizations  among  alternatives. 
These  prioritizations  may  be  across  some,  or  all, 
identified  alternatives,  at  the  top  level  of  the  hier¬ 
archy  of  attributes  or  at  some  intermediate  level.  If 
we  elicit  numerical  bounds  on  the  attribute  scores  for 
those  attributes  which  are  subordinate  to  and  included 
within  the  attribute  at  which  alternatives  are  prior¬ 


itized,  then  bounds  on  attribute  weights,  consistent 
with  the  wholistic  prioritization,  may  be  determined  by 
using  a  linear  programming  approach.  Alternately,  if 
weights  are  specified,  then  it  is  possible  to  determine 
bounds  on  alternative  scores  on  those  attributes  subor¬ 
dinate  to  the  attribute  at  which  prioritization  was 
made  through  use  of  linesr  programming  algorithms. 

As  slternatives  sre  identified  and  prioritized, 
updates  on  these  bounds  sre  made  available.  The  re¬ 
sults  obtained  from  using  the  inverse  aiding  feature 
are,  in  many  ways,  comparable  to  those  obtained  from 
the  regression  analysis  based  Social  Judgement  Theory 
[8] .  This  approach  provides  weight  identification 
only,  with  a  "confidence”  measurement  concerning  the 
validity  of  weights,  cardinal  preferences  are  assumed. 
Results  in  the  form  of  bounds  on,  or  ranges  of,  weights 
are  available  with  a  very  few  alternative  prioriti¬ 
zations  in  the  inverse  aiding  approach.  The  prior¬ 
itizations  needed  may  involve  a  mixture  of  cardinal  and 
ordinal  preferences.  For  a  large  number  of  prioriti¬ 
zations,  the  inverse  aiding  approach  may  become  cum¬ 
bersome  computationally  compared  to  the  regression 
based  approach,  where  additional  information  may  be 
easily  processed  in  a  sequential  fashion. 

Combination  of  inverse  and  direct  aiding  to  en¬ 
hance  decisionmaker  specification  of  imprecise  values, 
-■eights,  and  probabilities  enhances  the  usefulness  of 
ARIADNE  since  it  allows  for  judgments  and  their  expla¬ 
nation,  using  a  combination  of  formal  knowledge  based 
and  skill  based  modes.  This  enhanced  usefulness  will 
also  occur  through  encouragement  to  the  djecisionmaker 
to  become  more  aware  of  relevant  alternative  courses  of 
action  and  to  identify  new  alternatives  on  the  basis  of 
feedback  learning  of  the  impacts  of  alternatives  upon 
issues  and  objectives  in  a  behaviorally  relevant  way 
that,  hopefully,  encourages  "double-loop  learning"  [9]. 

3-  Structure  of  ARIADNE  j 

A  complete  set  of  activities  envisioned  in  using 
the  single  stage,  or  single  decision  node^  version  of 
AP  ADNE  involves  the  following  set  of  j  setivities. 

Formulation  of  the  Decision  Situation  j 

1.  Define  the  problem  or  issue  that  requires  planning 

and  decisionmaking  by  identification  j  of  its  ele¬ 
ments  in  terms  of  I 

I 

(a)  Needs,  and 

(b)  Constraints  or  bounds  on  the  issue. 

2.  Identify  a  value  system  with  which  to  evaluate 
alternative  courses  of  action,  and  identify  ob¬ 
jectives  or  attributes  of  the  outcomes  of  possible 
decisions  or  alternative  courses  of  action. 

3.  Identify  possible  alternative  courses  of  action, 
or  option  generation. 

Analysis  of  the  Decision  Situation 
1.  Determine  outcome  scenarios. 

.2.  Identify  decision  structural  model  elements,  that 

is  those  elements  or  factors  from  the  conceptual 
formulation  framework  which  appear  pertinent  for 
incorporation  into  a  decision  situation  structural 
model . 
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I.  Structure  decision  model  elemeuts: 

(a)  Structure  decision  tree, 

(b)  Structure  information  acquisition  and  pro¬ 
cessing  tree — which  may  be  part  of  the  basic 
decision  tree,  and 


step  1  of  the  Evaluation  and  Interpretation 
(C).  There  exists  many  possibilities  for 
obtaining  greater  alternative  evaluation 
specificity  such  as: 

(i)  setting  higher  aspiration  levels  or 
aspects. 


(c)  Structure  attribute  tree  or  objectives  hier¬ 
archy. 

Determine  independence  conditions  among  elements 
of  the  attribute  tree  and  decision  alternatives. 

5.  Identity  potential  for  the  use  of  deficient  infor¬ 
mation  prccessing  heuristics  and  provide  appro¬ 
priate  debiacing  procedures. 

6.  Determine  impacts  of,  or  outcomes  thay  may  result 
from,  alternative  courses  of  action. 

7.  Enccde  uncertainty  elements  in  the  form  of  event 
outcome  probabilities,  or  bounds  on  these,  to  the 
extent  possible. 

8.  Identify  risk  aversion  coefficients,  if  needed,  to 
the  extent  possible. 

9.  Identify  preference  or  value  functions,  or  bounds 
on  these  functions,  to  the  extent  possible. 

10.  Identify  attribute  weights,  or  bounds  on  these 
functions,  to  the  extent  possible. 

11.  Identify  wholistic  preferences  among  alternatives 
to  the  extent  that  this  is  possible. 

12.  Identify  possible  disjunctive  and  conjuctive 
aspects,  c.  thresholds  for  attributes,  of  iden¬ 
tified  alternative  courses  of  action. 


Evaluation  and  Interpretation  of  the  Outcome  of  Alter- 
native  Courses  of  Action 


1.  Identify  a  decision  aiding  protocol,  or  plan,  for 
evaluation  and  interpretation  of  the  decision 
situation. 


2.  Identify  potential  for  use  of  deficient  judgment 
heuristics. 


3.  Use  conjunctive  and/or  disjunctive  scanning  to 
eliminate  very  deficient  alternatives  and  retain 
alternatives  meeting  minimum  acceptability  cri¬ 
teria  across  attributes. 


4.  Determine  the  maximum  amount  ot  domination  infor¬ 
mation  possible: 

(a)  Display  domination  digraph. 

(b)  identify  alternative  courses  of  action  which 
could  not  be  among  the  N  most  preferred 
alternatives.  Normally  these  are  deleted 
from  further  consideration. 


(c)  If  the  decisionmaker  can  select  an  alterna¬ 
tive  for  implementation  by  wholistic  judg¬ 
ment,  or  prioritize  the  remaining  alternative 
set  through  heuristic  elimination,  then  go  to 
step  6  of  Evaluation  and  Interpretation. 

(d)  If  a  choice  cannot  be  made,  thea  assess 
further  information  about  values  of  impre¬ 
cisely  known  parameters  by  iterating  through 
steps  6-11  of  Analysis  (B).  then  return  to 


(ii)  moving  up  the  attribute  tree  by  deter¬ 
mination  of  a  subset  of  attribute  trade¬ 
off  weights, 

(iii)  "tightening"  bounds  on  attribute  trade¬ 
off  weights, 

(ivj  "tightening"  bounds  on  event  outcome 
probabilities,  possibly  through  infor¬ 
mation  processing  updates, 

(v)  "tightening  bounds"  on  value  or  prefer¬ 
ence  functions. 

5.  If  the  decisionmaker  has  provided  (partial)  whol¬ 
istic  preferences  as  part  of  the  analysis  effort, 
uae  these  with  the  inverse  aiding  feature  of  the 
aid  to  determine  bounds  on  sttribute  weights 
implied  by  these  preferences  such  as  to  provide 
learning  feedback  to  decisionmaker. 

6.  Conduct  sensitivity  analyais-  Provide  the  deci¬ 
sionmaker  with  an  indication  of  how  sensitive  the 
optimal  action  alternative,  or  prioritization  of 
alternatives,  is  with  respect  to  changes  in  values 
and  information  about  impacts. 

7.  Evaluate  validity  and  veracity  of  the  approach. 
Encourage  judgment  concerning  whether  the  formu¬ 
lation,  analyais,  and  interpretation  are  sound. 
If  not,  encourage  appropriate-  modification  to 
structure  and  parameters  associated  with  the 
decision  situation,  including  identification  of 
additional  attributes  and  alternative  courses  of 
action.  Then,  iterate  back  to  an  appropriate  step 
and  continue. 

In  our  work  to  date,  we  assume  that  the  details  of 
issue  formulation  and  analysis  are  accomplished  ex¬ 
ternal  to  the  interactive  aid  itself.  There  are  a 
variety  of  procedures  for  accomplishing  these  tasks. 
(10]  Our  research  assumes  that  there  exists  an  issue 
formula  ion  structure  and  that  the  impacts  of  alterna¬ 
tives  are  known.  These  are  provided  through  various 
elicitation  activities.  We  do  not  envision  that  the 
software  we  develop  for  interactive  interpretation . 
including  evaluation  and  prioritization,  will  generally 
be  suitable  for  use  independent  of  a  trained  decision 
analyst.  Whether  software  can  be  evolved  to  result  in 
an  appropriate  "stead  alone"  aid  is  very  dependent 
upon  the  environment  and  other  factors  that  constitute 
the  contingency  task  structure  for  a  specific  situa¬ 
tion.  In  situations  which  are  repetitive  and  environ¬ 
ments  which  are  stable,  such  ss  in  health  care  or 
equipment  fault  diagnosis  situations  for  example,  it 
seems  entirely  possible  to  design  useful  "stand  alone" 
aids.  In  most  strategic,  and  in  many  tactical  situa¬ 
tions  there  will  not  be  a  stable  underlying  structure 
that  will  easily  allow  this.  The  activities  involved 
in  issue  framing  and  the  identification  of  a  dominance 
structure  appropriate  for  decisionmaking  are  often  very 
situation  dependent. 

There  are  a  number  of  considerations  that  influ¬ 
ence  planning  and  decision  support  processes.  The 
person  using  a  decision  support  cyeres  should  be  aware 
of  these  considerations  if  best  use  of  the  aiding 
process  is  to  be  obtsined.  Generally  these  considera- 
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tioos  involve  the  operational  environment  and  the 
familiarity  of  the  decisionmaker  with  the  environment 
and  task  at  hand.  It  is  the  interaction  of  these 
factors  thst  influence: 

(1)  behavioral  characteristics  of  the  decision¬ 
maker, 

(2)  interaction  between  decisionauker  and  ana¬ 
lyst, 

(3)  choice  of  computer-based  support  for  deci¬ 
sionmaker  analyst  interaction 

Among  the  behavioral  characteristics  of  the  de¬ 
cisionmaker  that  influence  aiding  consideration 
strongly  are  the  facts  that  the  decisionauker: 

(1)  is  often  impatient  with  time  consuming  and 
stressful  assessment  procedures; 

(2)  wants  to  see  aosw  preliminary  results  proaq>t- 
ly  if  these  are  needed  or  wanted; 

(3)  may  lack  interest  in  interacting  directly 
with  complex  quantitative  procedures  for 
decision  aiding  that  do  not  seem  tailored  to 
the  specific  contingency  task  structure  of 
the  issue  at  hand;  and,  as  a  consequence, 

(4)  requires  a  decision  aiding  approach  that 
adapts  to  the  decisionmaking  style  appro¬ 
priate  for  the  decisionmaker  in  the  given 
contingency  task  structure. 

There  are  a  number  of  considerations  that  in¬ 
fluence  the  most  desirable  interaction  between  the 
decisionmaker  and  the  analyst.  The  interaction  must  be 
such  that  these  result: 

(a)  a  list  of  objectives  and  an  objectives  hier¬ 
archy; 

(b)  a  list  of  alternatives;  and 

(c)  a  list  of  outcomes  for  each  alternative. 

The  extent  of  the  need  for  the  use  of  these  identified 
lists  will  vary  greatly  with  the  "expertise”  of  the 
decisionmaker.  A  major  task  of  the  analyst  in  the 
formulation  and  analysis  portion  of  the  aiding  effort 
is  to  assist  the  decisiomaakrr  in  obtaining  these 
"lists"  in  a  behaviorally  relevant  and  realistic  man¬ 
ner. 

The  analyst  must  also  ensure,  to  the  extent  pos¬ 
sible,  that: 

(1)  the  above  lists  are  reasonably  nvplete; 

(2)  the  lowest  level  objectives  are  additively 
independent ; 

(3)  the  alternatives  are  mutually  exclusive,  and 

(4)  the  outcomes  that  follow  from  each  alterna¬ 
tive  are  mutually  exclusive  and  exhaustive. 

The  nature  of  the  interactive  process  is  such  that 
iterative  changes  can  be  made  in  terms  of  addition  or 
deletion  of  alternatives  and  attributes.  Nevertheless, 
there  are  significant  advantages  in  attempting  to  be 
reasonably  complete  at  the  start  of  the  interpretation 
portion  of  the  process. 


The  decisionmaker  must  provide  the  analyst,  fol¬ 
lowing  behaviorally  realistic  elicitation  procedures. 


information  regarding: 

(1) 

alternative  scores  on 
butes. 

lowest  level  attri 

(2) 

tradeoff  weights. 

v3) 

probabilities,  and 

(4) 

relative  risk  aversion 

coefficients 

or  appropriate  ratios  or  bounds  on  these  quantities 
which  represent  the  precision  that  the  decisionmaker 
believes  appropriate  or  ij  capable  of  providing  for  the 
given  decision  situation. 

There  are  many  computer  based  support  considera¬ 
tions  which  evolve  from  decisionmaker-analyst  interac¬ 
tion  considerations.  A  goal  of  all  decision  support 
system  design  efforts  is  to  obtain  "friendly  software", 
software  that  is  friendly  both  to  the  decisionmaker  and 
the  analyst.  In  particular,  the  analyst  must  be  able 
to  interpret  the  decisionmaker's  structural  and  param¬ 
eter  information  for  input  tu  the  computer.  To  do  this 
may  require: 

(1)  redefining  the  outcome  space,  such  as  redefi¬ 
nition  of  attributes  to  ensure  satisfaction 
of  independence  considerations  and 

(2)  describing  parameter  information  in  terras  of 
inequalities  (or  more  generally  set  member¬ 
ship)  . 

The  analyst  must  be  able  to  interpret  computer 
output  in  a  fashion  that  facilitates  the  decision¬ 
maker's  understanding  and  decisionmaking  abilities. 
The  analyst  must  be  able  to  assist  the  decisionauker  in 
responding  to  the  following  question  which  is  central 
in  our  interactive  knowledge  based  support  system: 

Has  sufficient  preference  and  structural 
information  been  elicited  from  and  provided  to  the 
decisionmaker  for  alternative  selection,  or  is 
more  information  required  for  identification  of  a 
dominance  structure  that  is  relevant  and  appro¬ 
priate  for  quality  decision  support? 

If  the  decisionauker  feels  that  an  alternative  can 
be  selected  for  action  iaqpleauntation  at  any  stage  in 
the  interactive  aiding  effort,  the  analyst  must  be  able 
to  encourage  decisionmaker  judgment  concerning  whether 
or  not  the  issue  formulation,  analysis,  and  interpre¬ 
tation  are  sound.  If  the  issue  formulation,  analysis, 
and/or  interpretation  are  not  perceived  as  sound  by  the 
decisionmaker,  the  analyst  must  be  able  to  encourage 
appropriate  structural  and  parameter  value  audifica- 
tion,  typically  by  means  of  sensitivity  analysis,  in 
order  to  insure  effective,  explicable,  and  valid  plann¬ 
ing  and  decision  support.  If  the  decisionmaker  cannot 
choose  an  alterustive  from  among  those  considered,  the 
analyst  must  be  capable  of  eliciting  further  structural 
and/or  parameter  information  to  enhance  appropriate 
selection  of  alternative  courses  of  action. 

One  very  Important  feature  of  a  knowledge  based 
system  for  planning  and  decision  support  is  encourage¬ 
ment  to  the  decisionmaker  for  generating  new  options, 
outcomes,  and  attributes  at  essentially  any  point  in 
the  aiding  effort;  and  ability  to  properly  evaluate 
theme  new  options.  The  analyst  must  be  able  to  cope 
with  this  additional  information  uader  the  assumption 
that: 
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(1)  ib*  new  intoi'Mtiuu  .  U  consistent  with  pre¬ 
viously  obtained  information;  or 

(2)  the  new  information  la  not  consistent  with 
previously  obtained  information  due  to 

(a)  structural  inconsistencies,  or 

(b)  parameter  inconaistencies. 

Thus,  the  capacity  to  resolve  potential  inconsis¬ 
tencies  through  interaction  with  the  decisionmaker  must 
exist  within  the  planning  and  decision  support  process. 
The  indirect,  or  inverse  decision  aiding,  feature 
should  be  of  particular  value  to  this  end.  Ir  a  "poli¬ 
cy  capture"  like  fashion,  this  Indirect  feafre  will 
allow  identification  of  bounds  on  attribute  weights  in 
terms  of  wholistic  preferences  among  some,  or  all, 
alternatives.  In  the  direct  aiding  feature,  values, 
weights,  and  probabilities  are  identified  and  priori¬ 
tization  of  alternatives  result  from  this.  Combined 
use  of  the  direct  aiding  feature  with  indirect  aiding 
should  result  in  much  learning  feedback  concerning 
relations  among  the  various  modes  of  judgment. 

ARIADNE,  as  we  have  noted,  does  not  contain  soft¬ 
ware  to  assist  in  the  formulation  and  analysis  portion 
of  the  planning  and  decision  support  effort.  It  is  in 
these  two  steps  that  alternative  choices,  attributes 
and  decision  isqracts  or  outcomes  are  elicited  or  iden¬ 
tified.  Our  effort  is  much  sore  concerned  with  the 
interpretation  part  of  a  decisionmaking  effort;  that  is 
to  say  how  inforswtion  is  processed  concerning  formula¬ 
tion  and  analysis  based  quantities  such  w-  probabili¬ 
ties,  values,  weights,  ratios,  and  bounds  upon  these. 
We  are  concerned  also  with  the  way  in  which  this 
information  is  aggregated,  by  any  of  a  variety  of 
formal  knowledge,  rule  based,  or  skill  baued  modes  of 
cognition  that  result  in  judgment  and  choice.  We 
recognize  the  difficulties  in  separating  the  tasks 
of  formulation  and  analysis  from  those  of  Interpre¬ 
tation.  There  are  difficulties  at  the  systems  manage¬ 
ment  level  since  the  way  in  which  people  cognize  a 
problem,  as  port  of  the  contingency  task  structure  of  a 
particular  situation,  determines  the  way  in  which  they 
will  go  about  resolving  it.  Thus  the  performance 
objectives.  Information  processing  style,  a,d  decision 
style  that  are  most  appropriate  and  that  are  likely  to 
be  used  for  a  given  task,  are  very  much  dependent  upon 
the  task  itself.  When  a  particular  concrete  opera¬ 
tional  or  skill  based  strategy  has  yielded  previous 
satisfactory, results,  many  people  will  tend  to  use  that 
strategy  unqeestiontngly  and  uncritically  In  new  situa¬ 
tions  perceived  to  be  similar.  This  can  result  in  very 
unsatisfactory  judgments  and  choices  in  decision  situa¬ 
tions  that  h^ve  changed  and  that  are  not  recognized  as 
different  from  familiar  past  situations  This  may 
result  in  prefsature  cessation  of  aearen  and  evaluation 
of  alternatives  prior  to  identification  of  quality 
strategies,  even  for  familiar  situation* .  file  efforts 
ran  be  devastating  in  unfamiliar  envlronoen' s  that  are 
not  so  recognized  111). 

The  strategies  which  a  decisionmaker  will  desire 
to  use  for  interactive  interpretation  will  be  strongly 
dependent  upon  the  way  in  which  the  task  requirements 
sre  initially  cognized.  This  will  influence  the  objec¬ 
tives,  attributes,  snd  alternatives  generated  in  the 
formulation  step  snd  the  value  scores  or  impacts  asso¬ 
ciated  with  them  in  the  analysis  step.  The  input 
information  to  the  interpretation  step  is  just  this 
information.  Adequacy  of  the  interactive  interpreta¬ 
tion  step  will  clearly  be  dependent  upou  the  "quality" 
of  the  information  input  to  it. 

Many  recent  studies  (12)  Uave  Indicated  that 
people  often  construct  selectively  perceived  simple 


deterministic  representations  of  decision  situations 
that  make  infonsatlon  processing  easy  and  which  do  not 
reflect  the  complexities  and  uncertainties  that  are 
associated  with  the  actual  situation.  A  goal  of  a 
decision  support  system  is  to  encourage  wide  scope 
perceptions  and  associated  information  processing.  The 
process  used  to  assess  probabilities,  utilities,  aud 
weights  will  doubtlessly  affect  the  quantities  that  are 
e'icited.  It  is  possible,  for  example,  that  a  poor 
elicitation  procedure  may ,  unknowingly  or  knowingly, 
create  rather  than  measure  values  {13}.  An  advantage 
to  formal  support  for  planning  and  decisionmaking 
processes  is  that  it  is  possible  to  conduct  a  search 
for  lncot siatent  judgment  and  perhaps  even  detect 
flawed  information  processing  heuristics  if  process 
tracing  is  used.  When  inconsistencies  are  discovered, 
it  then  becomes  possible,  at  least  in  principle,  to 
examine  the  judgamnt  process  to  determine  which  judg¬ 
ments  imply  flawed  information  prorcsslng,  end/or 
incoherent  or  labile  values,  and/or  deficient  decision 
rules.  A  major  blt'.mate  goal,  outside  the  scope  of  our 
present  study,  is  to  suggest  debiasing  and  other  cor¬ 
rective  procedures  to  enhance  the  quality  of  human 
information  processing  and  decision  rule  selection. 

This  mixed  scanning  based  planning  and  decision 
support  system  is  based  upon  rational  search  for  a 
dominance  structure  which  will  enable  exposure  of  some 
of  the  processes  upon  which  Judgment  and  choice  is 
based.  In  particular,  it  enables  determination  of  the 
precise  point  in  a  dominance  structure  search  process 
when  a  decisionmaker  la  able  to  select  a  single  non- 
dominated  alternative.  We  should  be  able  to  do  this 
without  resorting  to  a  complete  elicitation  of  precise 
paraawter  information  and  prioritization  of  all  alter¬ 
natives  The  activity  of  complete  precise  determina¬ 
tion  of  all  parameter  information  is  often  stressful 
and  time  consuming,  may  require  perspective*  outside  of 
the  experiential  familiarity  of  the  decisionmaker,  and 
allow*  few  results  until  conclusion  of  the  aiding 
effort. 

The  overall  process  described  bete  appears  well 
suited  to  sccnmmrj dating  the  fact  that  neither  indi¬ 
vidual*  nor  groups  possess  static  decision  styles 
capable  of  being  stereotyped  and  captured  by  a  rigid, 
inflexible  support  process.  It  »*  specifically  recog¬ 
nized  that  an  Interactive  process  is  needed  that  is 
capable  of  adaptation  to  a  variety  of  decision  styles 
that  are  contingency  task  structure  dependent.  System 
design  should  reflect  the  realization  that  is  generally 
not  possible  to  define  a  problem  or  issue  fully  until 
one  knows  potential  solutions  to  the  issue.  A  major 
cause  of  this  is  the  fact  that  information  to  fully 
define  the  issue  generally  becomes  available  only  a* 
one  evaluates  potential  solutions.  Planning  and  de¬ 
cisionmaking  will  therefore  necessarily  be  iterative. 

A,  Behavioral  Relevance  Issues 

Our  decision  support  system  design  paradigm  is 
based  upon  a  process  model  of  decisionmaking  in  which  a 
person  perceive*  an  issue  which  may  require  a  change  in 
the  existing  course  of  action.  On  the  basis  of  a 
framing  of  the  decision  situation,  one  or  more  al¬ 
ternative  courses  of  action,  in  addition  to  the  present 
option  which  may  be  continued,  are  identifier,.  A 
preliminary  screening  of  the  alternatives,  using  con¬ 
junctive  and  disjunctive  scanning,  may  eliminate  all 
but  one  alternative  course  of  action.  Unconflicted 
adherence  to  the  present  course  of  action  or  uncon¬ 
flicted  change  to  a  new  option  may  well  be  the  met* 
strategy  for  judgment  and  choice  that  is  adopted  if  the 
dti islonauker  perceives  that  the  decision  situation  is 
s  fnailiar  one  and  that  the  stakes  are  not  so  high  that 
a  more  thorough  search  and  deliberation  is  needed  111). 
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A  rl  ei  n«i  t  <*  1  y ,  it  the  Jcumod  •environment  is  an 
unfamiliar  one,  or  the  stakes  associated  with  judgment 
and  choice  aie  high,  a  more  vigilant  form  of  informa¬ 
tive  aujinsil  ,on,  analysis,  and  1  nterpret  at  ien  are 
called  for.  Th.s  desire  tor  more  vigilant  information 
processing  leads  to  a  search  for  a  dominance  pattern 
among  alternatives,  th^  search  lor  new  alternatives 
that  are  not  dominated  bv  presently  idetiiiiiol  alter¬ 
natives,  and  the  elimination  from  fur  her  consideration 
of  a  1  ternat  i  ve»  that  ar  dt-nmii'ed.  If  e.o  single  non- 
.  dominated  alt«  rnalivr  :s  found,  adjustments  to  the 
dominance  structure  of  alt<:natives  are  made  through 
various  forms  of  cognitive  activity  such  as.  attrioutp 
aggregation,  additional  information  acquisition  and 
analysis,  and  i  dent i f i cat  ion  of  additional  attributes 
and/or  a  1 ternat i ves .  This  is  continued  until  the 
st r."*  ire  of  needs,  objectives,  attributes,  ana  ?'ter- 
na„iv»  action  options,  and  their  impacts  are  such  that 
identification  of  a  single  non-dominated  alternative 
results.  This  “single  alternative'*  may  well  represent 
a  combination  of  subalternatives.  If  there  is  insuf¬ 
ficient  time  and  experience  to  accomplish  these  cog¬ 
nitive  activities,  hyper*'igi  1  ante  generally  results 
The  decisionmaker  is  in  a  situation  where  the  present 
course  of  action  is  diagnosed  as  unfortunate  and  there 
is  a  shortage  of  time  and  experience  that  might  enable 
i  i  at i f icat i on  and  evaluation  of  an  appropriate  one. 

Given  sufficient  time  and  experience,  vigilant 
information  processing  often  results  from  the  afore¬ 
mentioned  tasks.  Figure  1  presents  some  salient  fea¬ 
tures  of  this  dominance  process  model  for  search, 
discovery,  judgment,  and  ci.oice. 

The  mode  of  judgment  and  choice  that  is  "proper" 
depends  upon  the  decisionmaker's  situation  diagnosis  of 
the  contingency  task  structure.  Here,  "proper"  deci¬ 
sion  behavior  is  based  upon  the  assumption  that  the 
environment,  the  task,  the  exponent*  ’  familiarity 
with  the  task,  and  the  envi  ro«  ..if  »U  tha.  Cv  ■  ■■  ♦  i  *.  jL*.s  the 
contingency  task  stmeture  are  diagnose:.!  c  r?  tly.  If 
this  is  not  the  case,  then  the  st  rate,,it  s  le.ding  to 
u..wo«  f  1  icted  change,  adherence,  or  vigiU.it  information 
processing  may  be  significantly  flawed.  The  role  of 
t..e  contingency  task  structure  in  situation  diagnosis 
ard  in  influencing,  at  a  me*ta  level  or  systems  manage¬ 
ment  level,  the  proress  of  judgment  and  choice  is, 
th  re fore,  a  very  important  one. 

There  have  been  many  realistic  paradigms  of  the 
process  of  judgment  and  choice.  We  believe  that  th^ 
dominance  process  rodel  described  here  is  not  incon¬ 
sistent  with  the  primary  features  and  intensions  of 
these  descriptive  models.  Our  puipose,  however,  is  to 
develop  a  conceptual  design  for  a  prescriptive  approach 
to  judgment  and  .hoice  that  will  aid  in  the  search  for 
better  decisions.  We  rrccgnizr  that  a  truly  national 
approach  to  prescriptive  decisionmaking  must  be  cog¬ 
nizant  of  the  process  of  decisionmaking  as  it  evolves 
in  a  descriptive  fashion,  that  is  to  say  process  ra¬ 
tionality,  or  it  will  not  be  possible  to  evolve  sub¬ 
stantively  rational  support  systems. 

It  is  important  that  an  appropriate  decision 
support  system  he  capable  of  assisting  the  decision¬ 
maker  through  encouragement  of  full  i r format  ion  acqui¬ 
sition.  including  that  which  may  discontirm  strongly 
held  beliefs,  and  in  the  analysis  and  interpretation  oi 
this  information  such  as  to  avoid  a  variety  of  cogni¬ 
tive  blast’s  and  poor  1  nf o i ma t  i on  proressi  »g  heuristics 
that  may  l»ad  to  flawed  judgment  and  i.ioice  (2,12). 

A  realistic  derision  support  process  is  neces¬ 
sarily  iterative.  Several  desiderata  follow  from  this: 


1  .  We  should  allow  for  top-down  or  bottom-up 
structuring  of  the  attributes  of  outcomes,  or 
imnacts  oi  decisions.  The  "tree"  or  "hier¬ 
archy”  of  attributes  should  be  structured  to 
the  depth  b**l  ieved  appropriate  by  the  dr- 
c  i  s  i  otuna  ke  r . 

2.  Rather  than  force  a  decision  situation  struc¬ 
tural  model  in  the  form  of  a  tree,  we  should 
encourage  the  dec  is ionmaker  to  identify  a 
cognitive  map  of  goals,  objectives,  needs, 
attributes,  alternatives,  and  impacts  that  is 
reflective  of  the  way  in  which  the  decision- 
maker  perceives  diagnostic  and  causal  infer¬ 
ences  to  occur.  At  some  later  time  this 
cognitive  map  may  be  used  to  structure  a 
multinode  decision  tree  which  is  represen¬ 
tative  of  substantive  rationality,  but  not  at 
all  necessarily  representative  of  process 
rations  1  it  y . 

3.  We  should  entourage  identification  of  alter¬ 
native  courses  of  action,  additional  attri¬ 
butes  of  decision  outcomes,  and  revision*  to 
previously  obtained  elicitations,  at  any 
point  in  the  decision  support  process  as 
awareness  of  the  decision  situation  and  its 
structure  grows  through  use  of  the  support 
system . 

4.  We  should  not  force  a  person  to  quantify 
parameters  to  the  extent  that  this  becomes 
overly  stressful,  or  behaviorally  and  phy¬ 
sically  irrelevant  in  view  of  the  inherent 
uncertainties  or  imprecision  that  is  asso¬ 
ciated  with  the  know1 edge  of  parameters 
characterizing  the  decision  situation  struc¬ 
tural  mode',  or  their  assessment. 

These  have  two  priitary  implications  with  respect 
to  our  interpretation  efforts.  We  allow  for  revision 
in  the  elicited  structure  of  the  decision  situation  and 
for  the  identification  of  new  options  as  awareness  of 
tin'  decision  situation  grows.  Also,  we  do  not  require 
the  decisionmaker  to  quantify  parameters  beyond  the 
level  felt  appropriate  for  the  situation  at  hand.  If 
the  decisionmaker  feels  comfortable  in  exercising 
precision  with  respect  to  factual  outcomes,  this  is 
perfectly  acceptable  and  desirable.  »ut  parameter 
imprecision  should  he  allowed  if  we  are  to  have  a 
realistic  support  process. 

ARIADNE  allows  parameter  imprecision  in  order  to 
satisfy  this  quant i f i cat  ion  relevancy  requirement,  as 
do  approaches  based  on  fuzzy  set  theory  (16].  We 

encourage  the  decisionmaker  to  specify  precise  values 
or  numerical  ranges  for  facts  and  values.  Thus  we 

allow,  for  example,  expressions  for  altertative  (A) 

scoies  on  attributes  (i)  in  the  form  0.2  <  v.(A)  <  C.5 

i  - 

weights  associated  with  attribute  i  in  the  form  0.2  «■ 
w^  <  0.4,  and  probabilities  of  event  fi)  resulting  from 

alternative  A  in  the  form  0.3  <  P.(A)  <  0.45.  We  allov’ 

-i  - 

ordinal  representations  in  the  linear  forms  v.;'A)  * 

v  (B)  <  v .  (C) ,  2w .  <  w  <  v.,  P .  (A)  «:  P.{  A>  *  JPl(A), 
i  -  i  i  j  kj  *  i  -  k 

or  in  similar  forms.  Quant  i  f  icat  1  or.  ol  imprecision 

in  the  form  of  numerical  bounds  on  paraivetc  i  &  always, 

leads  to  what  we  call  "behaviora  l  ly  pons  i s  .wilt  infor¬ 
mation  sets  (HC1S)."  Sometimes  totally  icd*na!  infor¬ 
mation  may  nerd  further  quant i f i ca t ion  it*  order  to  raX- 
the  precision  and  rigidity  of  the  mathrm»v  cor¬ 
respond  to  the  intensions  of  the  de^is  i  onmakfV  in 
making  a  purely  ordinal  specification.  Tbij:  is  g~\  ■ 
orally  not  needed  to  obtain  solutions  but,  rather,  to 


obtain  parametric  models  that  are  faithful  to  the 
understandings  of  the  decisionmaker.  For  example  that 
ordinal  alternative  score  inequalities  0  <  v^CA)  < 

v^(B)  <  v^(C)  <  1  are  satisfied  by  the  relations  0  < 
vt(A)  <  l-2t,  W  <  Vj IB)  <  1-t,  2W  <  v  (C)  <  1  for  small 

positive  t  and  W  vhich  in  the  limit  become  zero.  It 
will  generally  not  be  the  case  that  the  decisionsMkrt 
would  express  this  much  imprecision,  and  would  wish  to 
see  it  more  fully  quantified  to  be  reflective  of  (sub¬ 
jective)  beliefs.  It  is,  therefore,  important  that  a 
simple  and  informative  display  of  value  scores, 
weights,  snd  probabilities  be  provided  to  the  decision-, 
maker.  This  will  enhance  interactive  use  of  the  sup¬ 
port  system  and  will  enable  learning  of  the  impact  of 
these  parameters,  and  associated  imprecision,  upon 
decisions . 

5.  Conclusions 

In  this  paper,  we  have  examined  some  underlying 
considerations  that  have  influenced  the  development  of 
a  decision  support  system  that  specifically  recognizes 
that  iaqareclse  and  incomplete  knowledge  is  important  to 
judgment  and  choice  and  which  allows  for  its  incor¬ 
poration  in  the  knowledge  base  of  a  decision  support 
system.  The  system  allows  for  judgment  and  choice  at  a 
skill  based  wholistic  level  as  well  as  at  the  formal 
reasoning  based  level  at  which  most  decision  analysis 
based  paridms  operate.  For  detailed  discussions  of  the 
algorithmic  content  of  ARIADNE,  the  reader  is  referred 
to  (6,7,14-26). 
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Figure  1.  Descriptive  Dominance  Structural  Model  of 
Decision  Process. 
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^  abstract 

'^Tit  this  paper.  we  presentff,the  objective*, 
operational  details,  results,  and  concluaion*  of  an 
evaluation  of  a  deciaion  aiding  procedure  ARIADNE.  The 
reaulta  of  the  evaluation  indicate  that  ARIADNE,  in 
comparison  to  a  well-known  deciaion  aiding  procedure 
called  SMART,  (11  haa  a  tore  flexible  aodel  of 
parameter  value  deacription  that  tend*  to  reduce 
aaaeaaaent  atreaa  and  aiakea  ARIADNE  i»ore  uaeful  in 
aituationa  where  information  preciaion  ia  poor,  (21 
allowa  earlier  preaentation  of  initial  alternative 
ranking  lnforawtion  and  (31  allowa  the  decisionmaker  to 
adjust  the  aiix  of  alternative  ranking  apeclflcitv  and 
parameter  value  preciaion. 

l  — 

1.  INTRODUCTION 


4*  ARIADNE  requires  lea*  time  for  uae  than  doea 
SMART. 

3.  Decisionmakers  do  not  feel  that  it  la  neceaaarv 
for  an  aid  to  produce  a  aingle  beat  alternative  to 
aaaiat  the  decisionmaker  in  aelecting  the  most 
preferred  alternative. 

6.  ARIADNE  la  More  uaeful  than  SMART  in  aituationa 
where  lnforatatlon  preciaion  la  poor. 

"•  Problems  typically  encountered  in  the  subjects' 
operational  environment  would  be  more 
appropriately  examined  aided  bv  ARIADNE  than  aided 
by  SMART. 

8.  ARIADNE  la  no  more  difficult  to  underatand  and  uae 
than  SMART. 


In  this  paper,  we  preaent  the  objective*. 

operational  details,  reaulta,  and  concluaiona  of  an  w*  now  diaeuaa  the  proceduree  for  teatlng  these 
evaluation  of  two  deciaion  aiding  approachaa,  ARIADNE  hypotheses, 
and  SMART.  An  lndepth  description  of  SMaRT  (Simple, 

MultlAttrlbute  Rating  Technique)  can  be  found  in  ***•  OPERATIONAL  ASPECTS  OF  EVALUATION 
Edwards  (1977);  Detailed  discussions  and  algorithmic 

descriptions  of  ARIADNeT  (Alternat  lve  Ranking  Eight  (8)  civilian*  employed  by  the  United  State* 

Interactive  Aid  baaed  on  DomlNance  structure  Foreign  Science  and  Technology  Center  (FSTC) 

information  Elicitation) We  presented  in  the  companion  participated  as  subjects  in  the  evaluation.  Each  of 

paper  (Sage  and  White.  1*83).  In  Section  II.  we  list  *h«  aubj.cts  had  had  extensive  involvement  in  technical 

the  hypotheses  that  were  tested  during  the  evaluation.  project  evaluation  in  a  military  environment  and  thus 

The  operational  aspects  of  the  evaluation  are  detailed  h,<*  sufficient  experience  to  appreciate  the 

in  Section  III.  We  present  the  result*  of  the  difficultie*  and  operational  issues  involving  proposal 

evaluation,  and  the  concluaiona  that  we  obtain  from  evaluation.  Proposal  evaluation  waa  a  subject 

these  relative  to  the  identified  hypotheses,  in  Section  addreaaed  in  the  more  specialised  of  the  two 

IV.  decisionmaking  scenarios  examined  by  the  subjects 

during  the  evaluation. 

II.  IDENTIFIED  HYPOTHESES : 

The  two  scenarios  developed  for  the  evaluation 
The  Intent  of  the  evaluation  waa  to  teat  the  were  proposal  evaluation  and  sports  car  selection.  Each 
following  hypotheses:  o{  these  scenarios  can  be  obtelned  from  the  authors. 

It  waa  assumed  thet  each  scenario  involved 

1.  Use  of  the  more  general  model  of  parameter  value  decisionmaking  under  certainty.  The  first  scenario  waa 

description  in  ARIADNE  tends  to  reduce  stress  designed  to  represent  a  realistic  proposal  evaluation 

associated  with  the  assessment  of  alternative  problem  that  might  occur  in  a  DOO  funding  agency, 

values  and  attrlbuta  weights.  Although  attributes  and  soma  ordinal  relations  among 

attribute  weights  were  specified  in  the  RFP  to  which 

2.  Uae  of  the  more  general  model  of  parameter  value  the  proposals  were  to  respond,  inforawtion  presented  in 

description  in  ARIADNE  tend*  to  increase  the  five  (M  submitted  proposals  from  which  to  deduce 

confidence  in  the  final  alternative  selected.  utility  scores  and  henca  tradeoff  weights  was  often 

vegue  and/or  not  available.  Aleo,  there  waa  room  for 

3.  The  ability  to  provide  additional  parameter  judgement  in  strengthening  the  ordinal  relatione  among 

information,  in  a  form  and  sequence  selected  by  the  attribute  weights  that  were  provided  in  the  RFP 

the  decisionmaker,  and  to  observe  its  impact  on  aummarited  in  the  scenario.  The  sports  car  selection 

alternative  ranking  in  an  iterative  feshl'on  ia  a  scenario  was  designed  to  represent  a  uuch  more 

desirable  feature  of  ARIADNE.  preciaely  defined  alternative  selection  problem. 


Standard  procedures  wars  uaed  in  order  to 
investigate  and  compensate  for  effect*  due  to 
,  facilitation  atyle  and  order  with  respect  to  decision 

Thia  research  haa  been  supported  by  the  Office  of  »ld  *nd  scenario. 

Naval  Research  under  contract  number  N00014-80-C-OS42. 
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Th«  chronology  for  the  evaluation  waa  at  follow*: 

1.  General  briefing.  A  briefing  waa  given  to  the 
volunteer  subjects  regarding  the  purpose  of  the 
evaluation  and  the  characterist les  of  the  two  aids. 
Individual  evaluation  sessions  were  scheduled,  and  both 
scenarios  were  given  to  the  subjects  to  read  prior  to 
the  individual  sessions. 

2.  Individual  sessions.  Individual  evaluation 

sessions  were  conducted.  Each  session  for  each 

individual  subject  involved  a  subject,  facilitator,  and 
computer  cersinal  operator.  If  ARIADNE  was  used, 
assessed  information  regarding  lowest  level  attribute 
utility  scores  and  tradeoff  weights  was  allowed  to  be 
lsipreclse  and  was  translated  into  linear  inequalities 
by  the  facilitator  and/or  cosiputer  terminal  operator. 
Initially,  only  utility  score  Information  for  the 
alternatives  on  the  identified  attributes  was  assessed. 
Once  this  assessment  was  completed,  a  domination 
digraph  on  the  alternatives  was  computed  and  displayed 
to  the  subject.  The  subject  could  also  view  a  score 
sheet  of  values  from  which  this  digraph  was  obtained. 
If  this  digraph  provided  sufficient  information  for 
alternative  selection,  then  this  portion  of  the  session 
was  halted.  If  not,  then  further  utility  score  and/or 
tradeoff  weight  information  was  requested  and  the 
resulting  domination  digraph  displayed.  This 
information  could  concern  attribute  scores  and  weights 
not  previously  obtained  or  more  precise  estimates  of 
previously  elicited  scores  and  weights.  This  iterative 
procedure  continued  until  the  subject  halted  the 
process. 

If  SMART  was  used,  all  parameters  were  precisely 
assessed.  Then  the  total  linear  order  on  the 
alternatives  was  displayed.  If  the  subject  wished,  a 
post  optimal  sensitivity  analysis  was  performed  on 
whatever  single  parameter  values  were  of  concern. 
Detailed  descriptions  of  facilitation  protocols  can  be 
obtained  from  the  authors. 

During  the  examinations  of  each  scenario,  the 
computer  terminal  operator  completed  an  Analyst 
Information  Sheet,  detailing  various  times  and  types  of 
requests.  After  completing  both  scenarios,  the 
subjects  were  asked  to  complete  a  short  questionnaire 
and  return  it.  Copies  of  both  the  Analyst  Information 
Sheat  and  tne  Questionnaire  can  be  obtained  from  the 
authors. 

IV.  RESULTS  AND  CONCLUSIONS: 

We  now  examine  each  of  the  hypotheses  in  the 
context  of  data  collected  during  the  evaluation. 

Hypothesis  0  1 :  Use  of  the  more  general  model  of 
parameter  value  description  in  ARIADNE  tends  to  reduce 
stress  associated  with  the  assessment  of  alternative 
values  and  attribute  weights.  Relevant  Questionnaire 
questions  (0)  and  responses  (R): 

01*  Being  allowed  to  express  parameter  values 
imprecisely  using  ARIADNE  produce i  more  stress 
than  being  required  to  state  ail  parameter  values 
precisely  using  SMaRT. 

R*  (presenting  individual  subject  scores  below  the 
line;  +5(-5)  indicates  strong  (dis) agreement) 

-5  0  +5 

1  6  8  7  2  **  5 

3 

indicating  a  tendency  to  agree  with  the  hypothesis. 
There  were  two  relevant  comments:  SI  (Subject  1): 
"■.iked  th?  flexibility.** 


S3:  "Perhaps  more  stress  initially  (with  ARIADNE) 
beesuse  I  didn't  understand  the  process.  But 
after  using  It,  I  would  be  inclined  to  say  far 
less  stress  (with  ARIADNE)  for  complex 
applications." 

06:  What  was  the  most  comfortable  way  of  expressing 
parameter  value  information  fer  you? 

R:  Indicated  that  two  subjects  (S2,  S3)  preferred 

exact  values,  two  subjects  (SI,  SB)  preferred 
interval  estimates,  snd  four  (S3,  S4»  S6,  S7) 
preferred  ranking  statements  for  expressing 
parameter  value  information.  There  was  one 
relevant  comment  from  S3:  * f or  the  exercise 

today,  [exact  value)  would  be  the  answer;  however, 
for  actual  application  In  complex  areas,  [ranking 
statements)  is  my  answer."  These  responses 
esuse  us  to  conjecture  that  if  the  subjects  had 
been  more  experienced  in  expressing  parameter 
values  imprecisely  (several  had  experience  in 
expressing  value  scores  and  tradeoff  weights 
precisely)  then  there  would  have  been  stronger 
support  for  this  hypothesis. 

Hypothesis  *  2:  Use  of  the  more  general  model  of 
parameter  value  description  in  ARIADNE  tends  to 
increase  confidence  in  the  final  alternative  selected. 
Relevant  Questionnaire  questions  and  responses: 

02:  I  felt  more  confidence  in  the  Iflnal  alternatives 

chosen  when  sided  by  ARIADNE  khan  in  the  final 
alternative  produced  when  aided  by  SMART. 

R:  i 

o  j  . 

2  7  4  l  3 

5  B 

6  I 

Indicating  slight  support  for  the  Itypothesis.  There 
were  no  especially  relevant  written  comments. 

OB:  I  disagreed  .  with  the  action  alternative 

recommendation-  obtained  using  ARIADNE. 

R:  (S1,...,S7)  "no";  SB  "don’t  remember." 

014:  I  disagreed  with  the  action  alternative 
recommendation  obtained  using  SMART. 

! 

R:  S2  "yes;"  (SI,  S3,  S  7 )  i  "no;"  SB  "don't 

remember."  Thus,  responses  to  questions  the 
latter  2  Indicate  no  recognizable  difference  in 
the  perceived  quality  of  the  decisions  made  us  in? 
ARIADNE  versus  those  made  using  SMART.  There  were 
no  relevant  comments  associated  with  either 
question. 

0*7:  Which  approach  would  you  prefer  to  use  to  make 
recommendations  to  others  concerning  evaluation 
and  prioritization  and  why? 

R:  (SI,  S3,  S7 ,  SB)  "ARIADNE;"  (S2,  S5,  S6)  "SMAPT;" 

S4  "neither."  Three  relevant  comments  were  made- 
S *♦:  "Depends  on  Situation."  S6r  "Since  SMART 

gives  percentages,  one  can  see  if  two  proposals 
are  close  and  then  adjust  the  rankings  by 
consideration  of  factors  that  were  not  originally 
considered."  S7:  "By  only  indicating  preference 
as  opposed  to  exact  values  is  very  helpful;  weight 
max-min  display  nice  feature." 

We  feel  S4*s  comments  tends  to  explain  his  response  to 
this  question.  The  comments  of  S6  and  S7  Indicate  that 
the  max-min  display  (Indicating  the  maximum  and  minimum 
values  of  expected  utility  for  each  alternative) 
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incorporated  into  ARIADNE  was  likevl  fcy  S7  and  m.."  not 
have  been  requested  by  S6.  We  conjecture  that  lad  S6 
seen  this  display,  his  response  would  have  been 
different. 

012:  Use  of  ARIADNE  encouraged  me  to  carefully  weigh 
the  positive  and  negative  consequences  of  each 
alternative. 

016:  Use  of  SMART  encouraged  me  to  carefully  weigh  the 
positive  and  negative  consequences  of  each 
alternative. 

Responses  to  012  and  Q16  were,  respectively: 

-5 _ 0 _ 3 

6  3  2  3  4  1 

7  8 


_ 0 _ 5 

3  17  2  5 

•4 

6 

6 

indicating  that  ARIADNE  and  SMART  provided  ar 
approximately  equal  level  of  encouragement  to  the 
subject  for  him  to  carefully  weigh  the  positive  and 
negative  consequences  of  each  alternative. 

In  summary,  questionnaire  responses  indicate  that 
the  level  of  confidence  in  the  output  of  ARIADNE  and 
SMART  appear  to  be  quite  similar.  Also  indicated  was 
that  both  aids  equally  encourage  the  careful  weighing 
of  the  possible  consequences  of  the  alternatives. 

Hypothesis  *  3.  The  ability  to  provide  additional 
parameter  information,  in  a  form  and  sequence  selected 
by  t he  decisionmaker,  and  to  observe  its  impact  on 
alternative  ranking  in  an  iterative  fashion  is  a 
desirable  feature  of  ARIADNE.  Relevant  Questionnaire 
questions  and  responses: 

Q3 :  Being  able  to  provide  additional  parameter 

information  and  then  to  observe  its  Impact  on 
alternative  ranking  in  an  iterative  fashion,  was  a 
desirable  feature  of  ARIADNE. 

R: 

-3 _ _ 5 

3  6  2  1  8 

7  3 


indicating  strong  support  for  the  hypothesis. 

As  an  indication  of  how  often  the  feature  of 
ARIADNE  being  evaluated  in  this  hypothesis  test  was 
exercised,  the  computer  terminal  operator  recorded  on 
the  analyst  information  sheet  the  number  of  iterations 
required  in  constructing  the  final  domination  digraph. 
The  number  of  iterations  for  each  subject  was 
respectivelv:  3,  3,  0,  *4,  0,  2,  3,  and  3. 

Thus  the  subjects  developed  a  reasonable  level  of 
experience  with  this  feature  of  ARIADNE,  placing  a  high 
level  of  confidence  in  the  responses  to  03. 

04:  Knowing  that  a  single  iteration  always  produces  a 

best  alternative  is  a  desirable  feature  of  SMART. 

R: 

-3  0 _ 3 

6  7  1  2  4  3 

3  8 


indicating  that  the  subject!  found  moderately 
attractive  the  f.nct  that  SMART  always  produces  a  best 
alternative  in  one  iteration. 

The  responses  to  03  and  Q4  Indicate  that  the 
dec isior maker  should  be  encouraged  to  be  as  precise  as 
possible  in  order  to  reduce  the  number  of  iterations  ot 
ARIADNE  for  final  decision  selection. 

Hypothesis  ft  4:  ARIADNE  requir  es  less  time  for 
use  than  does  SMART.  Relevant  quest. c ns  and  responses: 

03:  Use  of  SMART  lead  me  to  a  decision  more  quickly 

than  use  of  ARIADNE. 

R : 

•1 _ 2 _ l 

6  8  7  1  2  * 

3  5 

indicating  that  the  perceptions  of  the  subjects  tend 
not  to  support  the  hypothesis. 

Timing  data  recorded  on  the  analyst  information 
sheets  support  the  perceptions  of  the  subjects  with 
regard  to  the  average  total  length  of  time  per  session. 
Let  VET  *  value  elicitation  time,  WET  •  weight 
elicitation  time,  TST  ■  total  session  time.  Then,  the 
timing  data  (in  .ainutes)  are  as  follows: 


1 

2 

3 

4 

APS 

VET 

*1.75 

18.75 

19.25 

21.25 

WET 

9.5 

10.5 

6.0 

14.0 

TST 

57.0 

41.25 

47.25 

51.0 

ccv 

VET 

29.5 

24.25 

28.75 

45.0 

WET 

11.75 

9.75 

10.0 

11.5 

TST 

57.5 

40.75 

47.0 

51.25 

TOTAL 

VET 

55.625 

21.5 

24.0 

23.125 

WET 

10.625 

10.125 

8.00 

12.75 

TST 

57.25 

41.0 

47.125 

51.125 

where  1  •  ARIADNE,  2  •  SMART,  4  •  proposal  evaluation, 
3  •  sports  car  selection.  Thus,  use  of  ARIADNE 
required  on  average  402  more  total  time  than  did  SMART 
and  ■  the  proposal  evaluation  scenario  required  on 
average  8.52  more  time  to  evaluate  than  did  the  sports 
car  selection  scenario.  Both  facilitators  on  average 
required  49.125  minutes  per  scenario. 

Also  recorded  on  the  analyst  information  sheet  was 
the  length  of  time  between  the  beginning  of  the  session 
and  the  beginning  of  the  weight  elicitation  process. 
For  ARIADNE,  this  length  of  time  represents  an  upper 
bound  on  the  length  of  time  to  the  presentation  of  the 
first  digraph.  These  times  for  the  8  subjects  were: 
25,  41,  20,  30,  30,  !  5 ,  25,  33  for  an  average  of  27.623 
seconds.  We  remark  tnat  SMART  required  approximately 
*82  more  time  to  provide  initial  alternative  ranking 
information  to  ths  decisionmaker  than  did  ARIADNE.  If 
we  assume  that  total  session  times  for  SMART  and  the 
lengths  of  time  between  the  beginning  of  the  session 
and  the  beginning  of  the  weight  elicitation  process  for 
ARIADNE  are  realizations  of  normally  distributed  random 
variables  with  unknown  means  and  unknown  variances, 
then  a  standard  statistical  test  Indicates  that  these 
realizations  come  from  two  different  random  variables 
with  a  confidence  level  of  greater  than  0 .95. 
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In  iciatr;,  All.  ONE  required  leee  tlae  to  provide 
Inltlet  alternative  eelectlon  feedback  than  did  SMART 
but  aore  tlae  to  coaplete  the  entire  session. 
Therefore,  If  the  alternative  eelectlon  situation  la 
such  that  soae  alternative  ranking  lnforaatlon  Is 
better  than  none,  ARIADNE  Mould  tend  to  be  the 
preferred  aiding  procedure. 

Hypothesis  *  >■  Declslonaakers  do  not  feel  that  It 
la  necessary  for  an  aid  to  produce  a  single  best 
alternative  to  assist  the  decisionaaker  in  selecting 
the  aost  preferred  alternative.  Relevant  Questionnaire 
question  and  response; 

Q7;  It  Mould  be  necessary  for  an  aid  to  produce  a 
single  aost  preferred  alternative  before  I  Mould 
feel  that  I  could  select  the  best  alternative. 

R; 

-5 _ _ 0 _ i 

1  A  2  S  3  7 

J 
6 

indicating  strong  support  for  the  hypothesis.  A  recent 
evaluation  of  a  decision  support  systea,  designed  to 
provide  a  nondoalnated  set  of  alternatives  to  the 
decisionaaker  rather  than  a  single  aost  preferred 
alternative,  also  has  supported  this  hypothesis  (White 
et  el.,  1V82).  Therefore,  Me  conclude  ulth  high 
confidence  that  decision  situations  do  exist  uhere  the 
decisionaaker  dees  not  find  it  necessary  for  a  decision 
aiding  procedure  to  Identify  a  single  aost  preferred 
alternative. 

Hypothesis  S  6.  ARIADNE  Is  aore  useful  than  SMART 
In  situations  vhere  lnforaatlon  precision  Is  poor. 
Relevant  Questionnaire  questions  and  response: 


OR:  ARIADNE  is  aore  useful  than  SMART  In  situations 
vhen  lnforaatlon  precision  is  poor. 

R: 

-5 _ o _ ; _ s 

2  6  1 

5  7  3 

A 
8 

indicating  strong  support  for  the  hypothesis.  There 
uas  one  relevant  coaaent;  >8:  "Very  such  agree  that 
ARIADNE  aids  one  Mlth  iapreclse  data." 

Hypothesis  *  7.  Probleas  typically  encountered  in 
the  subject's  operational  envlronaent  Mould  be  aore 
appropriately  exaained  aided  by  ARIADNE  than  aided  by 
SMART.  Relevant  Questionnaire  questions  and  responses; 

Q10:  Typical  probleas  encountered  In  ay  operational 
envlronaent  would  not  be  appropriately  exaained  by 
SMART. 

013;  Typical  probleas  encountered  In  nv  operational 
envlronaent  would  not  be  appropriately  exaained  bv 
ARIADNE. 

Responses  to  010  and  Q13  were,  respectively: 

-3 _ 0__ _ 5 

5  2  6  A  3  1 

7 

8 


-3  0  3 

1  8  3  7  2 

A  3 
8 

These  responses  Indicate  a  perceptable  preference  for 
ARIADNE  over  SMART  In  the  subjects'  operational 
envlronaent . 

There  were  no  relevant  coaaents  to  Q10.  Relevant 
coaaents  to  Q13  were:  S2:  "My  operational  envlronaent 
does  not  Involve  actual  declsionswking  with  Multiple 
criteria.  It  does  Involve  analysis  of  the 
decislonaaking  of  other  partles/governaents."  S8: 
"Much  of  the  data  I  deal  with  are  abstract."The 
response  of  (and  further  discussions  with)  S2  Indicate 
a  need  for  an  inverse  decision  aiding  procedute  that  Is 
not  currently  operational  with  ARIADNE.  The  response 
of  S8  to  010  and  013  Indicate  that  "abstract"  appears 
to  be  svnonyaous  with  "vague"  and  "Iapreclse." 

We  conclude  that  ARIADNE  appears  to  be  soaewhat 
better  suited  than  SMART  for  the  operational 
envlronaents  of  the  subjects.  We  suspect  that  this  Is 
In  large  part  due  to  the  fact  that  hypothesis  *  6  had 
strong  support. 

Hypothesis  »  8:  ARIADNE  Is  no  aore  difficult  to 
understand  and  use  than  SMART.  Relevant  Questionnaire 
questions  and  responses: 

Oil:  SMART  Is  easier  to  use  than  is  ARIADNE. 

013:  ARIADNE  Is  easier  to  understand  than  Is  SMART. 

Responses  to  Oil  and  013  were,  respectively: 

-  0  3 

1  6  3  7  2  A  3 

8 

'  -3  0  3 

3  2  1  7 

AS  3 

These  responses  indicate  that  SMART  is  both  elapler  to 

use  and  easier  to  understand  than  ARIADNE  and  therefore 
contradict  the  validity  of  the  hypothesis. 

Relevant  coaaents  to  011  were:  S6:  "Since  one  has 
to  be  specific  in  SMART,  It  Is  harder  to  feel 
comfortable  with  the  exactness  of  the  results."  S8; 
"ARIADNE  does  require  soae  knowledge  of  computers  and 
the  Manipulation  of  data  on  thea;  or  at  least  one  that 
can  operate  a  computer."  The  S6  response  appears  to 
support  hypothesis  f  2. 

The  relevant  coaaent  to  Q13  was:  S6:  "Since  an 
"Interpreter"  was  used,  isost  Interfacing  problems  were 
eliminated." 

The  above  data  Indicate  that  the  evaluation  results 
contradict  the  veracity  of  this  hypothesis.  However, 
there  are  two  factors  which  contribute  to  this 
contradiction  that  could  possibly  be  eliminated  or 
mollified.  First,  many  of  the  subjects  had  had 
previous  experience  with  SMART-1  Ike  scoring  and 
weighting  assessment  procedures,  and  none  had  had  any 
experience  with  ARIADNE.  Second,  current  facilitation 
procedures  for  ARIADNE  are  more  complicated  and  less 
established  than  those  for  SMART.  We  conjecture  that; 
1.  a  sufficient  amount  of  familiarity  with  the  aore 
general  model  of  parameter  value  incorporated  Into 
ARIADNE  and  2.  the  completion  of  truly  established 
facilitation  procedures  for  ARIADNE  (as  exist  for 
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SMARTjwould  cause  ARIADNE  to  be  perceived  by  the  ueer 
••  no  aore  difficult  to  understand  and  use  than  SMART. 
Training  tlae  to  achieve  auch  a  level  of  faailiarlty  la 
likely  to  be  longer  for  ARIADNE  than  for  SMART  due  to 
the  relative  Increased  flexibility  inherent  in  ARIADNE. 

other  ieeuea  and  associated  oueetlone : 

OIS:  Was  the  posterior  sensitivity  analysis  associated 
with  SMART  helpful? 

Old:  Which  decision  asking  scenario  was  the  aost 
appropriate  for  ARIADNE  and  why? 

O''  Which  decision  asking  scenario  wos  the  aost 
appropriate  for  SMART  and  why? 

Responses  to  018,  Q19,  and  020  Indicated,  respectively, 
that:  1.  the  post-opt i aal it v  sensitivity  analysis 

feature  ms  useful  in  SMART,  2.  the  proposal  evaluation 
scenario  was  aost  appropriately  evaluated  using 
ARIADNE,  and  3.  the  sport  :  car  selection  scenario  was 
aost  appropriately  evaluated  using  SMART.  Generally, 
coaaents  to  questions  P  19  and  #  20  indicated  that  for 
coaplex  decision  selection  situations  having  less 
quantitative  lnforaatlon  available,  ARIADNE  would  be 
preferred  to  SMART,  which  provides  further  support  for 
hypothesis  f  6. 

Suaaarv.  our  evaluation  lends  credence  to  the 
following  claias: 

1.  The  aorc  flexible  model  of  parameter  value 
description  employed  by  ARIADNE  tends  to  reduce 
assessment  stress  and  makes  ARIADNE  wore  useful 
than  SMART  In  situations  where  Information 
precision  la  poor. 

2.  The  iterative,  progressive  lnforaatlon 
requirements  associated  with  ARIADNE  la  a 
desirable  feature  that  allows  earlier  presentation 
of  initial  alternative  ranking  lnforaatlon  than 
does  SMART. 

2.  SMART  requires  less  total  time  for  use  than  does 
ARIADNE. 

A.  Being  able  to  adjust  the  mix  of  alternative 
ranking  specificity  and  parameter  value  precision 
is  a  desirable  feature  of  ARIADNE. 

5.  ARIADNE  asy  require  more  training  than  would  be 
required  by  SMART  for  successful  use. 


8.  The  level  of  confidence  In  the  output  of  ARIADNE 
and  SMART  appear  to  be  quite  similar. 
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Abstract 


-^The  introduction  of  decision  aids  and  knowledge- 
based  expert  systems  incurs  resistance  when  non-congen- 
lal  styles  of  problem  solving  are  imposed  on  users.  On¬ 
going  research  addresses  the  design  of  computer-based 
display  and  analysis  systems  which  cater  flexibly  to 
personal  styles  while  providing  non-obcrusive  safe¬ 
guards  against  potential  errors  and  biases.  Capabili¬ 
ties  which  permit  monitoring  of  the  user's  task  by  the 
computer  and  of  the  computer  by  the  user  have  been  ex¬ 
plored.  ^ 

The  Problem 

High-level  users  of  computer-based  information 
systems  typically  find  that  either  too  little  or  too 
much  help  is  offered. [1] [2 ]  On  che  one  hand,  sophisti¬ 
cated  systems  are  available  for  data  retrieval,  analy¬ 
sis,  and  display,  yet  they  provide  little  guidance  in 
selecting  the  information  that  ought  to  be  retrieved  or 
the  type  of  analysis  which  the  user  ought  to  apply.  On 
the  other  hand,  decision  aids  and  knowledge-based  ex¬ 
pert  systems  typically  impose  an  analytical  structure 
and  mode  of  interaction  which  may  prove  inappropriate 
or  uncongenial  to  the  user's  own  preferred  style  of 
problem  solving.  Users,  in  short,  are  caught  between 
systems  that  automate  routine  functions  and  systems 
which  cannot  help  but  dominate  any  dialogue  with  the 
decision  maker. 

It  might  be  thought  that  as  computer-based  systems 
more  completely  automate  intellectual  tasks,  the  issue 
of  user  preferences  will  become  moot.  Yet  the  most 
critical  characteristic  of  these  new  applications  is 
that  they  are  neither  fully  objective  nor  demonstrably 
optimal.  Knowledge-based  expert  systems  incorporate 
the  assumptions  and  modes  of  reasoning  employed  by  hu¬ 
man  specialists.  Decision-analytic  aids  provide  logical 
constraints  for  inputs  from  human  experts  or  decision 
makers  regarding  subjective  probabilities,  preferences, 
and  problem  structure.  Both  kinds  of  systems  are  appro¬ 
priately  regarded  only  as  fallible  advisors.  Complete 
automation  could  be  inappropriate  if  users  possess  sub¬ 
stantive  expertise  or  analytic  insights  not  incorporated 
in  the  computer. 

What  is  required,  both  to  encourage  user  acceptance 
and  to  enhance  aid  performance,  is  a  repertoire  of  tech¬ 
niques  for  blending  the  expertise  of  the  user  and  com¬ 
puter.  Such  techniques  must  be  fine-grained  and  flexi¬ 
ble  enough  to  capture  shifting  availabilities  of  human 
and  computer  resources,  relative  levels  of  expertise, 
and  user  preferences. 

Unfortunately,  in  the  design  of  systems  that  foster 
cognitive  collaboration,  two  basic  objectives  tend  to 
conflict:  On  the  one  hand,  we  want  to  exploit  user  in¬ 
puts  where  (and  only  where)  they  can  enhance  the  over¬ 
all  credibility  of  aid  outputs.  On  the  other  hand, 
users  have  their  own  preferences  and  styles  of  problem 
solving  that  may  not  correspond  to  optimal  patterns  of 


allocating  cognitive  effort.  By  imposing  a  rigid  struc¬ 
ture  on  person-machine  interaction  (however  ''optimal'' 
it  may  be  from  the  point  of  view  of  relative  expertise), 
the  net  outcome  may  be  less  effective  problem-solving — 
including  perhaps  a  failure  to  use  the  system  altogether. 

To  deal  with  these  conflicting  objectives,  our  re¬ 
search  has  focused  on  three  broad  capabilities  in  cog¬ 
nitive  system  design: 

•  flexible  blending  of  computer  aid  human  contri¬ 
butions,  under  the  personal  control  of  the  user; 

e  monitoring  by  the  computer  of  selected  human- 
performed  tasks;  and 

a  monitoring  by  the  human  of  selected  computer- 
performed  tasks.. 

The  first  principle  maximizes  che  tailoring  of  person- 
computer  interaction  to  the  particular  style  of  a  user. 
The  second  and  third  principles  provide  a  prescriptive 
counterbalance:  they  are  designed  to  compensate  for 
deviations  from  optimality  that  may  emerge  from  the 
first  principle,  and  to  do  so  in  the  most  non-obtrusive 
way  possible. 

In  the  following  sections,  we  briefly  summarize 
some  of  the  research  we  have  done  under  these  three 
headings.  The  focus  is  on  the  psychological  underpin¬ 
nings  and  implications  of  the  work,  rather  than  on  the 
details  of  the  decision  aids  that  have  been  developed. 
This  work  has  been  supported  by  the  Engineering  Psycho¬ 
logy  Group  of  the  Office  of  Naval  Research  under  two 
on-going  contracts.* 

Aids  for  Personalized  Decision  Making 

Under  the  Defense  Department's  Small-Business  Ad¬ 
vanced  Technology  (DJSAT)  program,  DSC  has  explored 
the  design  of  a  computer-based  display  and  analysis 
system  which  is  customized  to  the  personal  cognitive 
styles  of  users. [3]  The  design  process  has  drawn  on 
relevant  work  in  the  cognitive  psychology  of  judgment 
and  choice,  in  computer  science,  and  in  the  prescrip¬ 
tive  theory  of  decision  making.  A  prototype  system, 
developed  for  attack  submarine  antisubmarine  warfare 
(ASW) ,  is  based  in  part  on  our  own  study  of  individual 
differences  in  decision-making  styles  among  submarine 
officers. 

The  Decision  Sotting  and  the  Decision  Process 

The  dilemmas  faced  by  the  command  staff  of  a  hunt¬ 
er-killer  submarine  in  approaching  and  attacking  an  (as 
yet)  unalerted  hostile  submarine  are  characteristic  of 
situations  involving  stealth’ in  warfare:  How  long 
should  I  attempt  to  remain  undetected  and  to  improve  my 
position,  before  I  tip  my  hand  by  launching  a  weapon? 

In  planning  an  attack,  the  Commander  faces  a  number  of 
choices  (among  weapons,  targets,  approach  maneuvers, 
and  times  of  fire)  and  is  flooded  with  an  increasingly 
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unmanageable  quantity  of  data  (about  the  target,  own 
hip,  and  environment).  To  capitalize  on  the  element  of 
surprise,  a  price  must  be  paid  in  the  quality  of  the 
lata,  the  complexity  of  options,  and  the  strenuousness 
>f  the  choice  process.  In  -11  of  these  areas,  there  is 
iubstantlal  leeway  for  differences  in  individual  cogni¬ 
tive  styles  of  coping. 

Situation  Assessment.  Assessment  tasks  must  de- 
>end  almost  exclusively  on  passive  sensors  (which  do 
lot  alert  the  enemy);  as  a  result,  data  are  often  frag- 
nentary,  noisy,  and  inconsistent.  Little  or  no  guid- 
ince  is  provided  in  reconciling  multiple  conflicting 
sstimates  of  the  same  variable  (e.g.,  target  range), 
organizing  data  acquisition,  assessing  the  quality  of 
estimates,  or  drawing  inferences  about  critical  oppor¬ 
tunities  and  dangers  (e.g.,  probability  of  kill,  pro¬ 
bability  of  counterdetection). 

Work  in  cognitive  psychology  suggests  a  number  of 
ways  in  which  people  may  simplify  the  cognitive  demands 
of  these  tasks  at  the  risk  of  suboptimal  performance. 
Where  multiple  estimates  are  available  for  a  single  var¬ 
iable  (e.g.,  target  range),  people  tend  to  Ignore  evi¬ 
dence  that  contradicts  a  favored,  or  earlier,  datum  and 
to  double  count  redundant  evidence. [A]  Patterns  of 
information  search  tend  to  avoid  stringent  tests  of 
favored  hypotheses. (5] [6] [7]  Assessments  of  degree  of 
certainty  tend  to  be  overconfident. [8]  When  inference 
proceeds  in  stages  (e.g.,  deriving  probability  of  kill 
from  information  about  range,  which  is  derived  from 
bearings  data),  people  often  act  as  if  conclusions  at 
earlier  stages  were  known  to  be  true,  rather  than  merely 
inferred. [9]  Similarly,  the  probability  of  a  detailed 
scenario  is  often  judged  higher  chan  the  probabilities 
for  component  events. [10] 

Option  Generation.  Interdependent  elements  of  a 
tactic  should  be  considered  together:  for  example,  use 
of  certain  types  of  weapons  may  be  precluded  by  the 
risk  of  counterdctectlon  by  a  third  party  threat,  unless 
appropriate  maneuvers,  firing  position,  and  time  of  fire 
are  selected.  The  consequences  of  immediate  decisions 
for  later  choices  may  also  be  critical  -  e.g.,  the 
ability  to  proceed  against  or  evade  a  second  threat 
after  the  initial  attack,  or  the  ability  to  respond  if 
unexpectedly  counterdetected. 

Research  suggests  that  the  process  of  formulating 
options  is  often  truncated  in  a  variety  of  ways.  People 
prefer  to  treat  the  elements  of  complex  options  as  if 
they  were  Independent  choices.  There  is  a  tendency  to 
formulate  options  that  span  only  a  short  time-frame, 
and  to  overlook,  as  a  result,  the  cumulative  risk  of 
pursuing  a  gl'-en  course  of  action  over  a  long  period  of 
time. [11]  Individuals  differ  in  the  degree  to  which 
future  acts  are  considered  lu  current  planning  [12]  and 
in  the  sheer  number  of  options  they  consider. [13]  Cus¬ 
tomary  way3  of  viewing  a  problem  tend  to  hinder  the 
generation  of  novel  and  creative  solutions. [14] 

Choice.  The  aim  of  avoiding  counterdetection  fre¬ 
quently  clashes  with  other  goals.  A  premature  attack 
may  both  alert  the  enemy  and  miss;  yet  continued  ap¬ 
proach  Increases  the  risk  that  own  ship  will  be  detected 
before  attack  or  that  the  target  will  successfully 
evade.  Perhaps  because  the  information  load  tends  to 
be  large,  simple  heuristic  decision  rules  are  often 
invoked:  e.g.,  for  time-of-fire,  "avoid  counterdetec¬ 
tion";  or  "fire  as  soon  as  within  maximum  weapon  range 
and  in  possession  of  a  range  solution." 

There  is  a  growing  literature  iu  cognitive  psycho¬ 
logy  suggesting  that  rules  like  these  may  be  adopted  to 
reduce  the  cognitive  effort  that  would  be  Involved  in  a 
thorough  consideration  of  each  option. [IS] [16]  With  one 


such  rule,  Elimination-by-Aspects  [17],  all  options  fal¬ 
ling  below  a  cutoff  on  an  attribute  are  eliminated,  and 
attributes  are  considered  in  turn  until  only  one  option 
remains.  In  "satisficing"  [IS] [19],  the  decision  maker 
considers  a  sequence  of  options  but  stops  as  soon  as 
he  finds  one  that  clears  a  cutoff  or  set  of  cutoffs  on 
selected  attributes.  In  each  of  these  examples,  an 
option  might  be  eliminated  even  though  it  scores  very 
high  on  some  dimensions.  In  the  submarine  context, 
such  rules  exclude  a  balancing  of  tradeoffs  -  such  as 
accepting  a  small  risk  of  detection  in  order  to  accomp¬ 
lish  a  mission  objective.  Satisficing  can  cruse  super¬ 
ior  options  (e.g.,  a  later  time  of  fire)  to  be  over¬ 
looked. 

It  has  been  suggested  that  experts  differ  from 
novices  in  their  capability  to  individually  recognize 
a  very  large  number  of  different  problem  situations. 

[20]  Klein  [21]  argues  that  experts  tend  to  reason 
holistically,  by  analogy  with  previous  similar  exper¬ 
iences,  rather  than  by  analysis  and  computation.  To 
the  extend  that  this  is  true,  we  might  expect  that  for 
experienced  commanders  all  stages  of  decision  making- 
situation  assessment,  option  generation,  and  choice— 
would  be  considerably  streamlined.  At  the  least,  we 
would  expect  decision  makers  to  differ  in  the  degree  to 
which  they  arrive  at  highly  integrative  conclusions 
without  the  necessity  of  a  large  number  of  explicit 
intervening  steps. 

Individual  Differences  in  Decision-Making  Style 

! 

Early  in  the  design  process  of  the  prototype  aid, 
data  regarding  individual  patterns  in  the  use  of  infor¬ 
mation  was  gathered  in  a  procedure  involving  four  form¬ 
er  submarine  command  personnel.  They  received  a  ques¬ 
tionnaire  describing  a  realistic  multiple  threat  ASW 
approach  and  attack  scenario.  The  questions  were  de¬ 
signed  to  focus  not  only  on  observable  patterns  of  in¬ 
formation  use,  but  also  on  the  less  conspicuous  deci¬ 
sion-making  processes  within  which  that  information 
plays  a  role.  At  each  of  a  number  of  break  points  in 
the  scenarios,  the  officers  were  asked  to  specify:  the 
information  currently  available  on  board  the  submarine 
which  they  would  seek,  the  source  from  which  they  would 
seek  It,  the  combat  decisions  that  depended  on  the 
Information,  the  way  the  information  would  affect  those 
decisions,  and  the  objectives  of  the  decision. 

Analysis  of  thlB  data  suggested  that  there  were 
important  differences  in  styles  of  data  gathering,  op¬ 
tion  formulation,  and  choice  to  which  an  aid  might 
caters 

Situation  Assessment  (A):  Amount  of  Information. 
The  total  number  of  items  ucilized  varied  considerably, 
from  42  information  requests  by  one  officer  to  18  by 
another. 

Situation  Assessment  (B):  Information  Search 
Pattern.  Requests  for  data  fell  into  two  quite  dis¬ 
tinct  patterns.  Two  of  the  officers  tended  to  organize 
data  acquisition  by  source,  asking  for  a  "dump"  of  cur¬ 
rent  estimates  from  sonar,  plot,  or  fire  control,  then 
going  on  to  another  source.  The  other  two  officers 
organized  data  acquisition  by  item,  asking  fo.'  a  given 
estimate,  like  target  range,  from  a  variety  of  sources 
or  else  selectively  requesting  different  items  from 
different  sources. 

Option  Generation:  Time-Span.  The  officers  dif¬ 
fered  in  the  time  horiztn  of  the  options  they  consider¬ 
ed,  e.g.,  focusing  exclusively  on  the  immediate  actions 
required  f>  regain  a  lost  contact  versus  evaluating  in 
advance  approach  tactics  contingent  upon  recovery  of 
the  contact. 
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Choice  (A) :  Level  of  Integration.  There  were  dif¬ 
ferences  among  officers  and  for  all  individual  officers 
across  situations  in  the  scope  of  the  objectives  which 
they  brought  to  bear  on  the  evaluation  of  options.  Ob¬ 
jectives  might  be  specified  quite  broadly  as  preserving 
own  ship,  ot  more  narrowly  as  avoiding  counterdetection 
or  watching  for  clues  regarding  counterdetection  status. 
Similarly,  the  goal  might  be  killing  the  target,  achiev¬ 
ing  a  suitable  firing  position,  or  opening  torpedo  tube 
doors. 


Choice  (B):  Number  of  Evaluative  Dimensions.  One 
officer  combined  concerns  for  own  ship  survival  and  kil¬ 
ling  the  target  in  all  decisions  (each  concern  might  be 
at  various  levels  of  Integration).  Two  of  the  officers 
appeared  to  shift  back  and  forth  in  their  focus  between 
these  concerns.  The  fourth  officer  went  all  out  for 
target  kill,  never  once  explicitly  mentioning  an  objec¬ 
tive  related  to  own  ship  survival  (at  any  level  of  inte¬ 
gration)  . 

Choice  (C):  Use  of  Cutoff  Criteria.  Three  of  the 
four  officers  evaluated  actions  explicitly  in  terms  of 
cutoffs.  All  three  used  the  achievement  of  maximum  wea¬ 
pon  range  as  a  criterion  for  attack;  one  used  arrival  at 
counterdetection  range  as  a  criterion  for  withdrawal. 

A  Prototype  Personalized  System 

A  prototype  personalized  aid  has  been  designed  and 
partially  Implemented  for  approach  and  attack  planning 
by  the  command  staff  of  a  nuclear  attack  submarine. 
However,  only  the  data  base  of  the  aid  is  affected  by 
the  nature  of  the  specific  application.  Its  functional 
logic,  and  the  methods  used  to  achieve  both  personaliza¬ 
tion  and  prescriptive  Impact,  are  quite  general.  The 
implementation  of  a  demonstration  prototype  system  in  a 
specific  context,  however,  permits  a  realistic  test  of 
the  feasibility  of  the  concepts,  with  potential  users. 


Figure  1  outlines  the  general  logic  of  the  cogni¬ 
tive  interface.  The  prototype  aid  design  consists  of  a 
data  base,  a  flexible  general-purpose  Planning  Module, 
and  four  relatively  specialized  routines  for  customiz¬ 
ing  the  aid.  The  system  utilizes  principles  of  spatial 
data  management  which  combine  an  undemanding  style  of 
interaction  with  a  high  degree  of  user  control  over  dis¬ 
play  contents.  All  user  inputs  are  via  a  single  simple 
locator  device  (a  joystick  plus  button)  with  control 
properties  that  shift  appropriately  with  the  display  re¬ 
gion  where  the  cursor  is  located. 


DATA  BASE 
(Store  basic 
Inputs,  fore¬ 
casts,  eval¬ 
uations,  ac¬ 
tion  alter¬ 
natives) 


vioure  1.  Structure  of  Prototype  Aid 
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The  display  area  of  the  Planning  Module  (Figure  2) 
is  divided  into  a  set  of  windows  which  permit  simultan¬ 
eous  viewing  of  substantive  results  (evaluations  of 
alternative  tactu.cs)  and  a  variety  of  menus  by  means 
of  which  the  user  can  specify  the  tactics  to  be  evalu¬ 
ated,  the  criteria  to  be  employed  in  the  evaluation, 
and  souices  of  validation  for  displayed  results.  A 
final  menu  enables  the  user  to  select  other  specialized 
modules  (Select,  Adjust,  Alert,  Advisory).  The  Plan¬ 
ning  Module  facilitates  a  variety  of  personal  prefer¬ 
ences  in  the  approach  to  situation  assessment,  formu¬ 
lation  of  options,  and  choice. 


ADJUST 

ALERT 

SELECTION 

ADVISORY 


CNTRDETECTION 

j FIRST  SHOT  KILL 

LOC/MOT 

TGT  VALUES 

KILL 

THERMAL 

0/S  WEAPON 

TGT  LOC/MOT 

TGT  CLSFN 

TGT  SENSORS 

TGT  MANEUVER 

Figure  2.  Planning  Nodule  Display 

Situation  Assessment.  The  data  base  consists  of 
basic  Inputs  (in  the  submarine  testbed  these  concern 
own  ship,  contacts,  and  the  environment)  together  with 
a  set  of  prescriptive  models  which  aggregate  those  In¬ 
puts  into  high-level  Inferences  and  forecasts  of  criti¬ 
cal  events  (e.g.,  counterdetection  and  first-shot  kill) 
and  evaluations  in  terms  of  ultimate  combat  objectives. 
The  Planning  Module  enables  users  to  sample  information 
at  any  preferred  level  of  aggregation  in  the  data  base. 
When  higher-level  Inferences  are  displayed,  the  Plan¬ 
ning  Module  clearly  distinguishes  conclusions  from 
evidence,  and  indicates  the  sources  from  which  each 
inference  is  derived.  The  user  may  elect  to  examine 
in  more  detail  any  of  the  evidence  utilized  in  deriv¬ 
ing  a  particular  conclusion. 

The  Selection  Module  allows  the  user  to  view  a  map 
of  the  total  data  base  and  to  personally  select  the 
portion  which  will  be  immediately  accessible  through 
the  Planning  Module. 

The  Adjust  Module  enables  the  user  to  insert  sub¬ 
jective  judgments  in  pi nee  of  default  values  at  any 
level  in  the  data  base.  The  Planning  Module  will  then 
display  the  implications  of  the  hypothetical  or  revised 
values  for  any  higher-level  Inference.  (Default  values, 
however,  continue  to  be  stored  and  displayed.)  The  Ad¬ 
just  Module  thus  accommodates  individual  differences 
in  beliefs  and  preferences  and  -  from  a  prescriptive 
point  of  view  -  adds  a  potentially  valuable  source  of 
information  (the  user)  to  the  data  base.  We  return  to 
this  feature  in  the  last  section. 

The  Alert  Module  performs  situation  monitoring  for 
the  user.  It  enables  him  to  set  a  cutoff  or  threshold 
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for  any  variable  in  the  data  base  (at  any  level  of  aggre¬ 
gation)  when  cutoffs  are  crossed. 

Option  Generation.  The  Planning  Module  facilitates 
the  formulation  and  evaluation  of  complete  tactical  op¬ 
tions  (weapons,  targets,  approach  maneuvers,  and  times 
of  fire).  It  enables  the  user  to  vary  the  number  of 
alternatives  examined  and  the  time  into  the  future  over 
which  an  option  extends.  A  version  of  the  aid  curreutly 
under  development  gives  the  user  a  choice  between  enter¬ 
ing  his  own  options  directly  for  evaluation  or  specify¬ 
ing  personalized  parameters  to  constrain  automatic  op¬ 
tion  generation. 

Choice.  In  the  Planning  Module,  the  user  can 
evaluate  options  by  reference  to  objectives  at  any  of 
a  variety  of  levels  of  integrative  scope  (e.g.,  how 
quickly  will  the  option  get  me  to  point  x?  How  will  it 
help  improve  probability  of  kill?  What  is  its  overall 
merit,  combining  probability  of  kill  and  probability  of 
own  ship  survival?). 

The  Alert  Module  facilitates  Individual  heuristic 
strategies  (such  as  Elimination-by-Aspects  and  satis¬ 
ficing)  which  evaluate  actions  by  reference  to  cutoffs 
as  opposed  to  tradeoffs.  After  the  user  sets  a  thresh¬ 
old  on  a  variable,  the  Planning  Module  forecasts  whether 
or  not  the  threshold  is  expected  to  be  crossed  for  any 
action  alternative  which  he  wishes  to  evaluate,  and  if 
so,  when.  Different  heuristic  strategies  for  choice 
imply  differences  in  the  way  information  is  searched: 
e.g.,  by  action  (run  through  all  relevant  evaluative 
variables  for  a  given  tactic,  as  in  prescriptive  theory 
or  satisficing)  or  by  criterion  (examine  all  options  for 
a  given  evaluative  variable,  as  in  Elimination-by- 
Aspects).  In  che  Planning  Module  both  of  these  search 
■odes  ere  specifically  facilitated. 

Prescriptive  Prompting 

An  important  factor  in  designing  a  personalized 
and  prescriptive  aid  is  the  impact  of  Individual  pre¬ 
ferences  on  outcomes.  Simplifying  for  illustrative 
purposes,  alternative  strategies  for  performing  the 
same  task  may  fall  into  one  of  two  classes  in  this 
respect : 

•  Strategy  A  is  generally  expected  to  be  more 
accurate  or  yield  more  preferred  outcomes  than 
strategy  B,  but  requires  more  training,  more 
time,  and/or  draws  away  more  attention  from 
other  tasks. 

An  example,  in  the  area  of  choice,  might  be  evaluating 
each  option  by  reference  to  all  the  relevant  dimensions 
(A)  versus  eliminating  some  options  by  reference  to 
only  a  few  (B).  (Or,  in  Inference  tasks,  ignoring 
important  sources  of  uncertainty.)  In  these  cases, 
differences  among  people  in  preference  between  A  and 
B  might  reflect  differences  in  their  underlying  ability 
to  perform  A,  in  their  training  or  knowledge,  in  their 
handling  of  workload,  degree  of  motivation,  or  their 
evaluation  of  the  cost  of  errors, 

e  For  some  people,  strategy  A  is  expected  to  be 
■ore  effective  (better  in  accuracy,  payoffs, 
speed,  effort,  etc.)  than  strategy  B,  while 
for  other  people,  strategy  B  is  more  effective 
than  A. 

Payne  [IS]  speculates  that  search  organized  by  options 
versus  search  organized  by  attributes  may  reflect  indi¬ 
vidual  differences  in  the  way  knowledge  is  internally 
represented.  People  who  differ  in  their  degree  of  ex¬ 
perience  or  areas  of  expertise  may  prefer  and  benefit 
from  different  ways  of  structuring  a  problem. 


These  distinctions  have  implications  for  the  appro¬ 
priateness  of  prescriptive  advice  in  a  personalized  de¬ 
cision  aid.  In  the  second  case  discussed  above,  the 
user  usually  does  best  with  the  strategy  which  he  pre¬ 
fers;  accorditgly,  au  interactive  system  should  simply 
facilitate  selection  by  the  user  of  the  information  pro¬ 
cessing  rule  or  structure  to  be  employed. 

In  the  first  case,  the  computer's  role  may,  at  the 
request  of  the  user,  be  somewhat  more  active.  It  in¬ 
volves  an  apparent  conflict  between  the  user-preferred 
and  the  normative  strategy  -  though  the  use  of  the 
former  may  in  fact  be  well  justified  by  savings  in  time 
and  effort.  In  such  cases,  the  computer  can  assist  by 
applying  a  prescriptive  model  to  the  problem,  in  para¬ 
llel  with  the  user's  own  effort  which  it  monitors.  The 
aid  may  then  advise  the  user  when  discrepancies  seem 
significant.  The  prescriptive  model  applied  by  the 
aid,  of  course,  has  no  automatic  claim  to  truth;  it 
takes  the  role,  rather,  of  a  "cooperative  adversary"  or 
"devil's  advocate."  It  enables  the  user  to  concentrate 
his  own  attention  selectively,  in  areas  that  he  regards 
as  critical,  while  notifying  him  when  other  Issues  seem 
worthy  of  attention.  Advisory  prompts  thus  complement 
the  freedom  of  individual  choice  granted  by  personaliz¬ 
ing  features;  they  encourage  flexibility  by  offering 
some  insurance  against  possible  pitfalls. 

Two  important  features  of  advisory  prompts  as  we 
seen  them  are  worth  stressing: 

•  The  objective  is  not  simply  to  alert  the  user 
whenever  there  is  some  difference,  however 
small,  between  his  judgment  and  the  output  of  a 
prescriptive  model.  Tne  difference  must  be 
large  enough  to  matter,  in  the  actions  to  be 
selected  and  in  their  expected  outcomes. [22] 

e  The  user  himself  determines  the  size  of  the  dis¬ 
crepancy  that  would  justify  a  prescriptive 
pronpt.  The  frequency  of  prompting  will  thus 
depend  on  his  own  informal  assessment  of  the 
value  of  his  time  and  effort  relative  to  the 
cost  of  errors.  The  Adjust  Module  of  the  per¬ 
sonalized  aid  enables  the  user  to  input  that 
judgment. 

Prompts  may  be  Introduced  to  assist  users  in  tasks 
of  situation  assessment,  option  generation,  and  choice. 
Our  current  research  involves  the  conceptual  design. 
Implementation,  and  testing  of  a  variety  of  such 
prompts. 

Situation  Assessment.  The  user  might  be  notified 
when  two  Information  sources,  both  of  which  are  regard¬ 
ed  as  credible,  have  contradicted  one  another.  He 
might  then  choose  to  Implement  prescriptive  procedures 
for  appropriately  readjusting  one  or  both  credibility 
assessments  downward.  A  prescriptive  prompt  might 
notify  him  on  future  occasions  when  either  of  the 
(partially)  discredited  sources  is  involved  in  an  iah- 
portant  conclusion. 

Advisory  prompts  might  signal  when  favored  infor¬ 
mation  search  patterns  seen  inefficient,  e.g.,  seeking 
additional  confirming  evidence  for  an  already  well- 
supported  hypothesis. 

Prescriptive  prompts  might  warn  users,  when  they 
estimate  or  subjectively  adjust  higher-level  inferred 
variables,  that  a  number  of  stages  of  uncertainty  must 
be  kept  in  mind.  The  same  type  of  caution  might  be 
appropriate  when  the  likelihood  of  a  compound,  or  con¬ 
junctive,  event  is  being  assessed. 

Option  Generation.  Short  range  planning  might  be 
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more  appropriate  in  some  situations  (e.g.,  where  feed¬ 
back  is  continuous  and  mistakes  can  be  easily  and  quick¬ 
ly  corrected),  while  long  range  planning  would  be  more 
suitable  in  others  (e.g.,  where  a  risk  appears  small  un¬ 
less  it  is  considered  cumulatively  over  the  long  run). 
Prompts  might  recommend  that  the  user  consider  a  shift 
in  time  horizon  under  appropriate  circumstances. 

A  variety  of  prompts  might  be  utilized  to  stimulate 
"creativity,"  or  th-  generation  of  novel  options.  The 
system  might  encourage  the  user  to  adopt,  hypothetical¬ 
ly,  a  new  "schema"  of  the  situation  by  questioning  his 
basic  assumptions  about  the  threat,  own  ship,  and  envi¬ 
ronment  -  especially  where  the  system  data  base  actually 
has  information  that  deviates  from  "normal"  conditions. 
Alternatively,  the  system  might  encourage  the  user  to 
better  delineate  the  space  of  options  by  generating  op¬ 
tions  tailored  to  single  objectives,  especially  objec¬ 
tives  not  so  far  considered  by  the  user. 


staff  in  balancing  and  integrating  the  diverse  sets  of 
relevant  information.  The  aid  displays  evidence  (i.e., 
particular  ranging  techniques  with  assessments  of  their 
quality)  as  well  as  conclusions  (a  single  best  guess  as 
to  target  range  together  with  an  interval  of  uncertain¬ 
ty).  This  aid  has  been  Implemented  for  testing  pur¬ 
poses  at  the  Naval  Underwater  Systems  Center  (NUSC- 
Newport). 

For  present  purposes,  two  critical  features  of 
the  aid  should  be  noted: 

e  It  can  operate  in  a  completely  automatic  mod-  . 

Default  estimates  of  pooling  parameters,  i.e.,  weights 
describing  the  precision  of  the  solutions  and  their 
correlations,  are  based  on  at-sea  exercise  data.  Ulti¬ 
mately,  default  parameters  will  be  contingent  on  a 
variety  of  environmental  and  threat  characteristics. 


Choice.  Advisory  prompts  might  signal  a  user  who 
is  employing  cutoffs  when  tradeoffs  bear  looking  into; 
in  particular,  where  tradeoffs  involve  evaluative  dimen¬ 
sions  he  has  not  as  yet  examined,  bore  generally,  the 
Planning  Module  might  monitor  the  user's  selection  of 
information  and  specification  of  options,  and  derive 
hypotheses  regarding  the  user's  decision  process  and 
conclusions.  The  user  would  be  advised  when  informa¬ 
tion  about  tactical  options  which  he  has  not  requested 
may  have  implications  for  choice  that  clash  with  the 
system-inferred  user  model. 

User  Override 

Ir.  a  personalized  decision  aid,  ultimate  control 
over  task  assignments  belongs  to  the  user.  Ue  have 
just  seen  how  this  flexibility  might  be  counterbalanced 
by  the  a.'u's  capability  to  monitor  the  user.  In  a  com¬ 
plementary  fashion,  the  user  might  quite  gladly  hand 
over  certain  tasks  to  the  aid.  retaining,  however,  the 
capability  of  monitoring  its  performance  and  interject¬ 
ing  his  own  judgments  where  he  deems  it  appropriate. 

In  a  second  project  for  ONE,  DSC  t  as  developed 
decision  aids  w  ich  can  ir corporate  both  objective  data 
and  subjective  judgment . (23] ( 24] f 25 ]  A  special  focus 
of  this  work  has  been  the  analysis  of  passive  sonar  data 
to  estinate  the  range  of  a  target  on  a  nuclear  attack 
submarine.  This  task,  logically,  should  be  Included 
within  the  situation  assessment  feature  of  the  attack 
planning  aiu  described  in  previous  sections.  In  par¬ 
ticular,  work  on  this  aid  has  shed  some  light  on  how 
the  Adjust  Module  might  be  utilized  to  facilitate  user 
inputs  into  an  otherwise  automatic  process. 

Problem  Setting 

Numerous  techniques  are  available  for  estimating 
target  range  -  based  or.  different  aspects  of  the  data 
(e.g.,  bearing,  intensity,  angle  between  direct  and 
reflected  sound  paths)  and  using  different  analytical 
tools  and  assumptions.  Typically,  since  their  sources 
of  error  are  both  pronounced  and  different,  they  pro¬ 
duce  quite  diverse  estimates.  Confronted  with  a  diver¬ 
gent  set  of  estimates,  the  commander  is  likely  either 
to  suspend  judgment  about  range  altogether  or  to  focus 
on  only  one  or  two  favored  techniques,  at  the  expense 
of  others  that  might  either  corroborate  or  contradict 
them.  Attack  may  be  needlessly  delayed  while  a  good 
solution  Is  improved,  or  be  launched  prematurely  based 
on  overconfidence  in  a  bad  one. 

Pooling  ..Id 


A  range  pooling  decision  aid  has  been  developed, 
utilizing  a  Bayesian  framework  to  assist  the  command 


•  The  user  can  interpose  his  own  assessments  In 
addition  to  or  in  place  of  default  estimates 
at  any  point  in  the  range  pooling  process. 

Preliminary  Testing  of  Interactive  Modes 

The  pooling  aid  has  been  tested  in  three  modes: 


(1)  totally  automatic  (default  weights), 

(2)  totally  subjective  (weights  supplied  by 
user),  and 

(3)  user  override  (default  pooled  solutions 
adjusted  by  user). 


Prerecorded  data  from  at-sea  exercises  were  used  to 
simulate  conditions  (2)  and  (3).  Recorded  command  staff 
estimates  ("system  solutions")  were  used  to  derive  sub¬ 
jective  weights  by  multiple  regression  of  command  staff 
estimates  on  the  particular  ranging  techniques.  Com¬ 
mand  staff  adjustment  of  default  pooled  solutions  was 
simulated  by  pooling  command  staff  estimates  and  de¬ 
fault  pooled  estimates. 


Figure  3  summarizes  the  results  of  this  test  for 
two  different  samples  of  Rangex  data: 


Parameter-estimation  sample 
Cross-validation  sample 


2.0 


Command 

Pooling 

Adjusted 

staff 

with 

default 

estimate 

subjective 

weights 

pooled 

solution 

Sample 

Size 

50#,57# 

78*',  95* 

501,55# 

Figure  3.  Ratio  of  Mean  Absolute  Error  (MAE)  for  Var¬ 
ious  Interaction  Modes  to  MAE  for  Default  Pooled 
Solution.  #  -  Rangex  1-78  data;  *  ■  Rangex  1-78  and 
1-79  data 
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Subjective  Pooling.  Pooling  with  subjective 
weights  was  super  !  it  in  accuracy  both  to  the  command 
staff  estimate  and  to  the  specific  ranging  techniques. 
Although  command  staff  estimates  were  superior  to  par¬ 
ticular  ranging  techniques,  the  superiority  of  pooling 
with  command  staff  weights  to  the  command  staff  itself 
suggests  that  toe  information  actually  available  to 
the  command  staff  was  not  being  optimally  utilized  by 
them. (2b]  These  results  would  occur,  for  example,  if 
the  command  staff  were  probabilistically  selecclng 
among  estimates,  with  probabilities  dependent  ct  their 
relative  accuracy,  rather  than  pooling. 

Automatic  Pooling.  Pooling  with  default  weights 
vas  more  accurate  than  pooling  with  subjective  weights. 
This  is  not  surprising  since  default  weights  were  opti¬ 
mized  for  the  type  of  data  Involved  in  the  test.  It  is 
at  least  possible  that  subjective  weights  would  outper¬ 
form  default  weights  in  situations  which  differ  sharply 
from  exercise  condltlonr. 

I'set  Over  tde.  The  most  accurate  result  was  ob¬ 
tained  in  the  third  condition,  where  objective  (default) 
data  and  subjective  Inputs  were  combined.  This  strongly 
suggests  that,  despite  ineffective  utilization,  coctmand 
personnel  have  access  to  relevant  information  not 
incorporated  in  the  pooling  aid. 

This  Information  can  be  tapped  without  burdening 
personnel  with  the  task  of  formally  pooling  estimates. 
Leaving  that  job  to  the  decision  aid,  appropriate  staff 
might  nonetheless  monitor  its  performance  and  make 
adjustments  when  they  observe  significant  discrepancies 
from  their  own  intuitive  solutions.  The  Alerting  Mod¬ 
ule  can  assist  in  this  monitoring,  by  alerting  staff 
when  default  pooled  range  estimates  or  Intervals  of 
uncertainty  fall  outside  a  user-specified  "plausible" 
region.  Quite  apart  from  any  enhancement  of  user 
acceptance,  our  data  suggest  that  Incorporation  of 
judgments  in  addition  to  objective  Jata  can  Improve  the 
quantitative  accuracy  of  aid  outputs. 

Other  Applications 

The  requirement  of  stealth  In  warfare  often  im¬ 
poses  a  severe  constraint  on  communication  among  friend¬ 
ly  units.  Coordination  can  be  achieved  by  prespecify¬ 
ing  courses  of  action,  but  at  the  expense  of  flexibil¬ 
ity.  The  combination  of  automatic  aid  functioning  and 
user  override  capability  offers  a  different  approach. 

It  may  be  applied  to  option  generation  (for  example, 
user  override  of  default  option  generation  settings) 
and  choice  (for  example,  user  override  of  default  eval¬ 
uations  of  (utenmes,  such  as  the  relative  worth  of 
different  tyocs  of  targets).  In  either  of  these  cases, 
default  settings  might  be  based  on  doctrine  or  mission 
directives;  the  provision  of  oveiride  will  then  set  a 
balance  between  central  guidance  and  flexible  response 
to  unique  circumstances. 

Conclusion 

Both  th •  attack  planning  aid  and  the  range  pooling 
eld  met  with  seme  ojcceos  in  initial  demonstra¬ 

tion-  wU.n  » eg  -li-.entflt  Ive  potential  users.  Nonethe¬ 
less,  mm  if  not  most  of  the  basic  ideas  presented  in 
this  recoil  recall  untested.  Careful  work  remains  to 
bt  cine  i.:i  delimiting  the  cognitive  structures  and 
mod? i  of  '  rocesaing  t.  which  aids  should  cater,  in  de¬ 
fining  arid  te.ing  1,0  i -ootrusive  prescriptive  prompts, 
in  Hev.tf/lng  non  -buroensome  methods  for  incorporating 
Judgment,  an  in  cyveJoping  guidelines  to  determine 
when  ana  f  ir  wt.ors  method*  of  the  sort  described  b  re 
are  a- prop:  laic.  The  hoped  for  benefits  include  noth 
increase!  user  acceptance  and  improved  system  ocrform- 


*  The  work  described  was  supported  by  the  Office  of 
Naval  Research,  Engineering  Psychology  Group,  under  Con¬ 
tract  Numbers  N00014-82-C-0138  (technical  monitor  J. 

O' Hare)  and  N00014-8Q-C-0046  (technical  monitors  M.A. 
Tolcott  and  G.S.  Maleckl),  with  the  collaboration  of  the 
Naval  Underwater  Systems  Center  (Code  35).  R.C.  Bromage 
R.V.  Brown,  J.O.  Chinnis,  L.  Merchant-Geuder,  J.W.  Payne 
and  R.  Parlsoau  have  contributed  in  a  variety  of  ways 
to  this  effort. 
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Abstract 

There  are  three  primary  methods  for  timely  information  transfer 
that  are  employed  in  the  Fleet  today.  These  consist  of  digital  data 
transfer  (such  as  Link  11),  record  message  traffic  (such  as  Fleet  Satel¬ 
lite  Broadcast  or  NAVMACS),  and  voice  communications.  With  the 
advent  of  command  facilities  with  advanced  display  and  processing 
capabilities,  there  evolves  a  fourth  method  for  information  transfer: 
command  summary  display.  The  summary  display  can  be  composed 
of  graphic  information  (such  as  Large  Screen  Displays)  or  alphanu¬ 
meric  information  (such  as  the  Automatic  Status  Boards).  It  has  been 
determined  that  in  many  situations,  the  Composite  Warfare  Com¬ 
mander  (CWC)  for  the  Battle  Group  requires  the  same  summary  dis¬ 
play  information  that  a  subordinate  warfare  commander  has  at  his 
disposal  in  another  command  facility.  Having  similar  information  dis- 
playrd  at  both  sites  enhances  coordination  between  the  commanders, 
and  transferring  this  information  in  the  form  of  summary  displays  can 
greatly  reduce  the  need  for  replication  of  special  purpose  processing 
and  data  bases.  While  the  transfer  of  summary  displays  can  greatly 
enhance  the  coordination  between  commanders,  it  also  places  a  heavy 
burden  on  link  communications.  Therefore,  it  is  imperative  that  only 
essential  information  be  transferred  and  that  compression  and  abbre¬ 
viated  representation  techniques  be  used  in  the  transfer  of  the  display 
information. 

(  N  K,  •  *  ' 

~'  J  This  paper  will  Casus  on  the  transfer  of  command  summary  dis¬ 
plays  between  command  facilities  to  enhance  warfare  commander 
coordination  and  the  requirements  that  these  display  transfers  place 
on  the  display,  decision,  and  communications  systems  that  are 
involved. 


1.  Intrpduffjqq 


Composite  Warfare  Commander  Concept 


Current  Navy  directives  provide  for  the  Commander  Battle 
Force/Group  at  his  discretion  to  employ  the  Comoosite  Warfare 
Commander  (CWC)  doctrine  in  command  of  his  forces. fl  1  The  use  of 
this  doctrine  under  control  of  the  Officer  in  Tactical  Command  (OTC) 
establishes  a  CWC  who  is  responsible  for  defending  the  force  from  air, 
surface,  subsurface,  and  other  types  of  threats.  When  necessary,  the 
CWC  will  designate  three  subordinate  warfare  area  commanders  who, 
when  delegated  authority,  coordinate  offensive  and  defensive  opera¬ 
tions  with  conflicts  being  resolved  by  the  CWC,  who  retains  control 
by  negation. 


Command  Support  Facility  Development 

The  Johns  Hopkins  University  Applied  Physics  Laboratory 
(JHU/APL)  under  the  Battle  Group  Anti-Air  Warfare  Coordination 
(BGAAWC)  Program  is  developing  the  Battle  Group  Anti-Air  Warfare 
Display  Group  (BADG)  to  determine  the  display,  decision,  and 
communication  requirements  of  the  Anti-Air  Warfare  Commander 
(AAWC).  The  BADG  serves  as  a  development  tool  for  the  design  and 


test  of  advanced  display  and  derision  aid  capabilities  for  the  AEGIS 
Display  System  (ADS)  to  support  the  AAWC  aboard  the  new  AEGIS 
Class  cruiser  (CG-47).  The  displays  of  BADG  and  ADS  comprise  four 
Large  Screen  Displays  for  graphic  presentation  of  track  data,  ten  CRT 
monitors  configured  as  Automatic  Status  Boards  for  display  of  sum¬ 
mary  alphanumeric  information,  and  multiple  other  special  purpose 
displays.  Once  initiated,  all  displays  can  be  automatically  updated 
from  the  data  base  by  the  BADG  processors. 

In  a  parallel  effort,  the  Navy  Ocean  Systems  Center  (NOSC)  is 
developing  the  Tactical  Flag  Command  Center  (TFCC)  and  associated 
equipment  to  support  the  Battle  Group  OTC  and  CWC  aboard  Naval 
aircraft  carriers  and  other  selected  combatants. 

With  continued  development  of  these  systems  it  has  been  recog¬ 
nized  that  there  is  an  imperative  need  to  determine  interoperability 
requirements  for  the  ADS  and  TFCC  in  order  to  fully  meet  their 
respective  Commander’s  requirements  for  coordination  and  exchange 
of  information.  Effort  was  initiated  at  JHU/APL  to  propose  a  set  of 
ADS/TFCC  interface  guidelines)2)  that  discussed  the  factors  that 
affect  the  interoperability  of  these  two  systems,  the  types  of  informa¬ 
tion  that  should  be  transferred  to  meet  AAWC  and  OTC/CWC  com¬ 
mand  and  coordination  requirements,^)  and  the  communications, 
processing,  and  display  systems  that  will  be  available  to  perform  such 
an  information  transfer. 

CWC  Concept  Premise 

The  development  of  interoperability  requirements  between  afloat 
command  facilities  is  dependent  on  the  extent  to  which  the  CWC  con¬ 
cept  will  be  employed  in  the  Battle  Group  (BG).  In  order  to  develop 
capabilities  to  meet  the  most  stressing  requirements  for  coordination, 
the  following  assumptions  were  made.  First,  the  location  of  the  war¬ 
fare  commanders  on  different  platforms  intensifies  coordination 
requirements.  Second,  full  delegation  of  responsibilities  by  the  CWC 
to  the  warfare  commanders  is  the  most  stressing  case.  Third,  the 
maintenance  of  the  warfare  area  complete  track  picture  is  the  most 
communication  intensive  of  the  warfare  commander’s  responsibilities. 


II,  Command  Facility  Capability 
Assumed  Capability 

In  general,  the  command  facilities  being  developed  to  support 
the  CWC  and  his  subordinate  warfare  commanders  provide  similar 
capabilities  in  terms  of  communications,  display,  data  base,  and 
processing,  although  the  methods  of  implementation  may  differ  signi¬ 
ficantly.  In  developing  the  interoperability  requirements,  the  follow¬ 
ing  assumptions  were  made  with  respect  to  the  command  facilities 
capabilities. 

Each  command  facility  will  have  access  to  the  full  set  of  tactical 
digital  data  links  and  satellite  links  that  are  available  to  a  surface 
combatant.  Figure  1  shows  the  1990  communications  environment 


for  the  BADG/ADS  configuration  that  will  be  placed  on  the  AEGIS 
T1CONDEROGA  Glass  cruisers  to  support  the  AAWC.  Figure  2  shows 
the  corresponding  communications  environment  for  the  TFCC  that  is 
to  support  the  OTC/CWC  aboard  carriers  and  other  selected  com¬ 
batants.  The  full  complement  of  links  are  required  to  ensure  that 
both  sites  are  full  participants  in  all  coordination  and  reporting  nets 
of  the  BG  for  access  *o  tactical  sensor  data  and  exchange  of  command 
messages. 

All  BG  organic  and  non-organic  sensor  information  should  be 
available  at  each  command  facility.  This  implies  not  only  the  need 
for  access  to  external  communications  links  to  obtain  the  sensor  data, 
but  the  ability  to  store  and  process  the  track  information  as  well.  This 
requirement  is  particularly  important  to  the  warfare  commander,  who 
is  maintaining  the  BG  track  picture  for  his  warfare  area,  in  that  it 
eliminates  the  need  to  transmit  the  large  volume  of  track  information 
to  the  CWC,  since  the  CWC  already  has  similar  information.  Instead, 
the  warfare  commander  need  only  send  interpretative  information  of 
the  track  picture  such  as  groupings,  identification  resolution,  and  area 
boundaries,  thus  reducing  communications  loading. 

The  command  facilities  should  have  the  capability  to  present 
Automatic  Status  Board  (ASTAB)  information  which  consists  of 
alphanumcrics  on  a  CRT  display  and  to  present  graphics  and  alpha- 
numerics  on  a  Large  Screen  Display  (LSD).  These  are  required  to 
display  the  BG  status  information  and  track  picture  respectively. 

Finally,  it  is  necessary  that  each  command  facility  maintain 
technical  information  data  bases  (ship  characteristics,  sensor  perform¬ 
ance,  weapons  capabilities,  etc.)  so  that  the  bulk  of  this  information 
Joes  not  have  to  be  transmitted  between  facilities  when  referring  to  a 
track  with  a  particular  identification. 


III.  Command  Facility  Communication; 

Current  Methods 

There  are  three  primary  methods  for  timely  information  transfer 
between  command  facilities  in  the  Fleet  today.  The  fastest  method  is 
the  computer-to-computer  digital  data  link  such  as  Link. l’L  and  Link 
4A  that  is  used  to  carry  track  information  and  limited,  force  orders. 
The  method  most  commonly  employed  for  ilic  routine  message  traffic 
is  the  teletype  transmission,  which  is  usually  sent  overHF  circuits  or 
on  satellite  channels  such  as  Fleet  Satellite  Broadcast  and  NAVMACS, 
and  carries  record  message  traffic,  operational  orders,  summary  and 
status,  and  limited  contact  reporting.  The  most  heavily  depended 
upon  method  is  still  voice  communication  that  is  used  for  coordina¬ 
tion  and  almost  everything  else. 

Proposed  Future  Methods 

With  the  advent  of  the  advanced  display  and  processing  capa¬ 
bilities  in  the  command  support  facility,  the  potential  exists  for  „ 
fourth  method  for  information  transfer:  command  summary  display. 
The  summary  displays  that  are  obvious  candidates  are  the  alphanu¬ 
meric  ASTABs  and  the  graphics  LSDs.  In  addition,  operational 
orders,  summary  and  status,  and  contact  reports  that  are  manually 
entered  for  teletype  transmission  can  now  be  automated  with 
computer-aided  message  generation.  Advanced  command  facility 
processing  would  then  generate  selected  classes  of  messages  directly 
from  the  data  base  contents  and  would  prompt  the  operator  when 
necessary. 
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Figure  2.  TFCC  Communications  Environment. 


IV.  Summary  Display  Transfers 


Attributes 

It  has  been  determined  that  in  many  tactical  situations,  the  CWC 
for  the  Battle  Group  requires  the  same  summary  display  information 
that  his  subordinate  warfare  commanders  have  in  their  command 
facilities.  HI  For  example,  while  under  the  CWC  concept  the  warfare 
commander  m?y  be  delegated  the  responsibility  to  maintain  the  force 
track  picture  for  his  warfare  area  as  well  as  force  status  for  sensors, 
weapons,  and  engagements,  the  CWC  also  requires  this  information  to 
exercise  control  by  negation. 

Having  similar  summary  information  displayed  at  both  sites 
enhances  coordination  between  commanders  when  used  in  conjunc¬ 
tion  with  other  methods  of  communication  (such  as  a  voice  discussion 
of  a  track  picture).  Transferring  this  information  in  the  form  of 
summary  displays  can  greatly  reduce  the  need  for  replication  of 
special  purpose  processing  and  data  bases.  The  summary  displays  can 
be  sent  automatically  on  alert,  periodic,  event,  or  query  basis  by  the 
command  facility  processing  system.  This  method  of  information 
transfer  has  a  drawback  though,  in  that  it  does  not  facilitate  local 
storage  (other  than  snapshot)  or  processing  of  the  information  by  the 
receiving  system.  If  the  intent  is  to  performs  a  data  base  to  data  base 
transfer,  then  a  compu<er-to-computer  data  link  transfer  would  be 
much  more  efficient. 

While  the  transfer  of  summary  displays  greatly  enhances  coordi¬ 
nation  between  commanders,  it  can  also  exact  a  great  price  in  link 
communications  loading.  Therefore,  only  essential  information  should 
be  transferred,  and  techniques  for  compression  and  abbreviation 
should  be  used  to  lessen  the  communications  burden. 


Automatic  Status  Board  Transfers 

An  ASTAB  contains  up  to  16  lines  of  32  alphanumeric  characters 
in  multiple  adaptable  formats  that,  once  initiated,  are  automatically 
updated  and  maintained  by  the  processor.  In  order  to  transmit  the 
ASTAB  display,  a  Navy  character-oriented  message  format  called 
RAIN  FORM  GOLD  was  chosen  because  of  its  capability  to  handle 
free  text  messages.  The  display  contents  are  packed  in  the  message  in 
a  full  display  compressed  serial  format.  Figure  3  shows  an  example 
ASTAB  and  Figure  4  .hows  the  corresponding  RAINFORM  GOLD 
message.  In  this  example,  an  asterisk  (•)  followed  by  a  number  (n) 
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Figure  3.  Example  Automatic  Status  Board  (ASTAB). 
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Communications  Media 


HEADER 


ZYN-UUUUUIEOU 
(PRECEDENCEIIOTGIIEOL) 

(ORIGIN  ATORIIEOU 
(AOORESSEEHEOU 
(UNCLASSIFIEDIIEOL) 

MSGID/COMMANO/GOLD/SER  OSSS/JUNIEOU 
NARR/ASTB:FORCOMSATL_FORCE_COMMUNICATION_STATUS/(EOU 
/*f  UNKSM4  ST/TUS'4  EMCON'4JMD/IEOL) 

/' 7  JTIOS'14  GRN'7  U*«  GRN/IEOU 

1*1  UNK  TIM2  GRN*7  U*«  VEL/IEOU 

1*1  UNK  4A12  GRN'7  U*«  GRN/IEOU 

/•S  VOICE  NETS/IEOU 

1*7  AAW  CMO*12  GRN'7  U'S  GRN/IEOU 

/•7  AAW  RPT*12  GRN'7  U'S  GRN/IEOU 

1*1  UNK  COORD'S  GRN'7  U'S  GRN/IEOU 

/*SCAP  CNTR/IEOU 
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1*1  R3Y1S  GRN'7  U*«  VEL/IEOU 

1*7  N75*13  GRN'7  U*»  GRN/ENO/IEOLI 

ENOAT/DECL—31 _ DECEMBER _ 1986IEOU 

BTIEOMI 


Figure  4.  Example  ASTAB  RAINFORM  GOLD  Message. 

denotes  a  string  of  n  spaces.  This  serial  compression  technique  is 
economical  for  a  string  of  3  or  more  spaces  between  characters  and 
can  save  up  to  50  percent  in  message  length  for  displays  with  few 
characters  and  many  spaces. 


A  formatted  character  field  transfer  was  considered,  where  each 
command  facility  would  store  a  copy  of  all  possible  ASTAB  formats 
with  header  and  column  information,  and  only  tactically  significant 
information  fields  would  need  to  be  transferred.  This  format  was 
rejected  since  it  constrained  the  type  of  ASTABs,  reor'  -d  force 
ASTAB  configuration  control,  and  increased  processing  complexity. 

Large  Screen  Display  Transfers 


A  LSD  contains  graphic  and  alphanumeric  information  as  well  as 
multiple  special  symbols,  that  are  commonly  configured  to  represent 
force  track  plots.  The  RAINFORM  GOLD  message  format  was  also 
chosen  for  the  LSD  display  transfer.  In  this  case  the  graphics  informa¬ 
tion  is  represented  in  the  message  in  the  form  of  a  display  overlay. 
The  overlay  for  a  circular  boundary,  for  example,  is  described  by  the 
letters  “CIR”  followed  by  the  position  of  the  circle  center,  and  its 
radius.  Figure  5  shows  the  overlay  representation  for  a  line,  dashed 
line,  ciHe,  dashed  circle,  arc,  rectangle,  string  of  'ext,  and  a  symbol  in 
a  RAINFORM  GOLD  message. 

As  previously  mentioned,  the  information  transferred  from  a 
warfare  commander’s  LSD  track  plot  includes  only  interpretative 
information  and  special  tracks.  It  does  not  include  maps  and  track 
position  information  which  are  assumed  to  already  be  available  at 
both  sites.  Not  only  does  this  reduce  communications  loading,  it  also 
eliminates  interoperability  problems  due  to  displaying  map  informa¬ 
tion  through  different  types  of  map  projections  used  in  the  command 
support  systems  (mercator  versus  orthographic  projection). 


It  is  assumed  '.tit  the  summary  display  RAINFORM  GOLD 
messages  will  be  sen  within  die  BG  via  the  Officer  in  Tactical  Com¬ 
mand  Information  Exchange  System  (OTCIXS)  of  FLTSATCOM  or 
via  a  character-oriented  subchannel  of  the  Joint  Tactical  Information 
Distribution  System  (JTIDS).  It  is  important  that  multiple  paths  exist 
for  transmission  of  this  information  to  improve  survivability  and 
throughput. 


V.  Conclusion 

Communication  between  the  Battle  Group  CWC  and  subordinate 
warfare  commanders  through  summary  display  transfers  can  enhance 
coordination  between  commanders  when  used  in  conjunction  with 
other  communications  methods.  The  display  transfers  require  ad¬ 
vanced  processing,  data  base,  and  display  capabilities,  but  can  be 
performed  using  existing  Navy  message  formats  on  existing  and  near- 
future  communications  links.  The  transfer  of  graphics  displays 
requires  the  development  of  an  intersystem  graphics  language  such  as 
the  aforementioned  graphic  overlays. 

Future  efforts  in  this  area  include  examination  of  other  inter¬ 
system  graphics  languages  such  as  North  American  Presentation-Level- 
Protocol  Syntax  (NAPLPS),f+'  modeling  of  command  facility  display 
message  exchange  to  determine  communications  loading,  and  the 
actual  demonstration  of  command  summary  display  transfers  between 
the  BADG  facility  at  JHU/APL  and  the  TFCC  facility  at  NOSC. 
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Figure  5.  Large  Screen  Display,  RAINFORM  GOLD  Formats. 
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Existing  tactical  display/control  consoles  which 

are  located  in  Combat  Information  Centers  -f6ICe)0 _ 

aboard  Navy  vessels  do  not  take  full  advantage  of 
existing  hardware/software/human  factors  technology. 
In  addition  to  limitations  imposed  by  hardware  con¬ 
straints,  the  user-interface  software  imposes  a 
diff icult-to- learn  interface  upon  the  complex  task- 
demands  of  the  command  and  control  environment.  The 
Naval  Ocean  Systems  Center*''  C'OSC)  has  developed  a 
prototype  for  a  command/control  console  which 
features  many  design  advantages  in  comparison  to 
existing  consoles.  The  new  configuration  presents  a 
diverse  array  of  human-engineering  issues,  some 
specific  to  this  console  and  others  gsneric  to  all 
consoles.  An  overview  of  these  issues  and  relevant 
research  conducted  at  NOSC  is  presented  in  this 
paper. 

Introduction 

In  1978,  NOSC  began  conceptual  design  of  a  CIC 
console  which  was  originally  conceived  to  meet  the 
needs  of  light-surface  combatants  such  as  hydrofoils. 
The  applications  for  this  new  console  expanded  to  all 
Navy  combatants  as  the  design  was  determined  to 
alleviate  many  present  console  problems.  Specifi¬ 
cally,  these  problems  include:  High  power  consump¬ 
tion  (requiring  water  cooling) ,  high  cost,  heavy 
weight,  and  lack  of  flexibility/growth.  The  design 
approach  was  to  incorporate  current  computer  and 
display  technology  to  alleviate  these  problems.  The 
console  is  currently  called  the  Lightweight  Modular 
Display  System  (U(DS) ,  and  it  has  shown  to  be  very 
promising  in  terms  of  meeting  these  functional 
design  goals. 


Figure  1.  Lightweight  Modular  Display 
Prototype  Console 


UffiS  Hardware  Configuration 

The  present  console  configuration  consists  of 
two  primary  user-interface  units,  shown  in  Figure  1. 

A  single,  high-resolution,  monochrome,  15-inch  (diag¬ 
onal)  CRT  display  unit  and  an  11-  x  15-inch  high-reso¬ 
lution  digitizer  tablet  are  used.  The  display 
resolution  is  1024  x  1368  discrete  points  or  pixels. 
The  effective  viewing  area  is  8  x  10.75  inches. 

Figure  2  presents  the  general  display  format  seen  by 
the  operator.  The  center  8-  x  8-inch  area  is  used 
primarily  for  tactical/positional  and  tabular  data 
presentation.  The  display  area  on  the  right  euid  left 
margins  provide  information  such  as  control  labels, 
control  actuation  feedback,  system  status,  and  other 
prompts.  The  area  to  the  left  of  the  display  unit 
is  reserved  for  communications  equipment  and  analog 
display  controls. 

The  digitizer  tablet  has  discriminate  points 
at  .0,4-inch  separation  (250  per  inch)  in  both  X  and 
Y  dimensions.  It  responds  to  light  pressure  from 
either  a  finger  touch  or  stylus,  and  transmits  a 
digital  word  which  represents  the  centroid  of  the 
pressed  area.  A  cursor  is  presented  on  the  display 
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Figure  2 .  General  LMDS  Display  Format 
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corresponding  to  the  location  on  the  tablet.  Position 
is  updated  rapidly  (approximately  100  msec.)  such  that 
position  feedback  delay  is  not  perceivable.  There  are 
no  labels  printed  on  the  tablet  in  the  current  config¬ 
uration. 

A  Plexiglas  cover  is  placed  over  the  tablet  to 
provide  usable  space  for  the  placement  of  materials 
such  as  printed  messages  ana  notes.  A  movable  fold¬ 
down  keyboard  is  being  considered  which  would  store 
beneath  the  Plexiglas  cover  and  fold  down  over  the 
tablet.  Plungers  placed  beneath  the  keys  would  acti¬ 
vate  the  tablet.  Metal  overlays  which  would  help 
guide  finger  placement  on  the  tablet  are  also  being 
considered. 

Key  Human  Factors  Study  Areas 

The  interface  design  goals  for  LMDS  include  the 

following : 

1.  Improve  feedback  to  user 

2.  Reduce  user-memory  load 


typical  console  user  graduates  to  on-the-job  train¬ 
ing  where  he  acquires  a  working  knowledge  of  the 
system.  This  cycle  is  constantly  repeated  as 
personnel  are  transferred  or  leave  service.  These 
conditions  clearly  dictate  the  need  for  an  interface 
which  can  prompt  and  aid  the  novice  user,  while  pro¬ 
viding  flexibility  and  'shortcuts'  for  the  experi¬ 
enced  user.  Present  consoles  and  interface  software 
do  very  little  of  either. 

The  shipboard  environment  requires  that  equip¬ 
ment  meet  tough  militarization  requirements,  and 
that  noire,  vibration,  and  platform  movement  are  con¬ 
sidered  in  control/display  design.  Watch  lengths 
are  usually  4  hours,  but  may  be  longer ^  and  the  sea- 
state  may  impose  a  degraded  environment  for  control 
and  display.  These  constraints  must  be  considered 
together  with  the  precision  required  when  selecting 
input  devices  (i.e.,  fixed  or  movable).  Equipment 
and  communication  'nolle'  combine  to  produce  a  high 
level  of  ambient  sound.  Ambient  noise  will  limit 
the  effective  use  of  aural  feedback. 

Human  Engineering  the  IMPS  Configuration 


3.  Accommodate  different  user  skill  levels 

4.  Improve  data  filtering  capability 

5.  Improve  response  time/accuracy  for  track 
designation 

6.  Reduce  operator  errors 

7.  Reduce  operator  training  requirements 


A  comparison  of  user  tasks  and  information 
requirements  with  the  hardware  configuration 
provides  a  'shopping  list'  of  Human  Factors 
considerations  from  which  trade-off  studies  can 
be  generated.  Table  1  presents  a  list  of  the 
major  task  areas  and  a  subjective  rating  con¬ 
cerning  the  adequacy  of  the  display  and  tablet 
in  supporting  each  task  area.  Performance  within 
each  of  these  task  areas  is  under  continuing 
study  at  the  NOSC  LMDS  Laboratory. 


Meeting  these  design  goals  will  require  careful  study 
of  user  information  requirements,  user  tasks,  user 
skill  levels,  operational  environment  and  training 
methods. 

User  Regulrements/Taaks .  Since  the  LMDS  console  rep¬ 
resents  a  radical  departure  from  existing  consoles  it 
forces  the  designer  to  make  a  comprehensive  study  of 
the  interface  needs  for  all  command/control  tasks. 
User  information  needs  and  tasks  will  vary  consider¬ 
ably  across  the  personnel  structure  of  the  combat 
team.  Whereas  previous  consoles  were  primarily  used 
for  'input'  operators  (i.e.,  those  tracking  targets 
and  updating  tactical  data  bases) ,  future  consoles 
will  also  be  used  as  workstations  for  high-level 
decision-maker3.  In  addition,  many  of  the  tasks 
which  are  currently  done  manually  will  be  automated 
to  a  greater  extent  in  future  Combat  Direction 
Systems  (CDS).  Thus,  more  tasks  will  be  delegated 
to  a  supervisory  control  mode  with  the  operator 
requiring  frequent  updates  on  automated  task 
processing  with  the  capability  for  manual  override . 
Information  requirements  for  higher-level  decision¬ 
makers  will  be  increased  by  an  enlarging  of  the 
threat  area  and  sensor  ranges,  coupled  with  an 
increased  variety  of  countermeasures/weapons.  Users 
must  bo  able  to  cope  with  an  overwhelming  amount  of 
available  data  by  using  display  controls  th*t  allow 
for  data  filtering  of  positional  and  tabular  data. 

User  Skill3/Trainlng/Environment .  It  is  useful  to 
think  of  Navy  console  users  as  falling  along  a  con¬ 
tinuum  from  novice  to  expert  users .  Novices  are  new 
to  the  Navy  Tactical  Data  System  (NTDS)  by  virtue 
of  being  new  recruits  or  by  transfer  from  a  non-NTDS 
ship.  Typical  training  involves  a  3-  to  4-week 
•Input'  course  followed  by  a  3-  to  4-week  'User' 
course  for  higher-level  decision-makers.  Training, 
therefore,  can  be  described  as  a  quick  overview  of 
the  1000-plus  command  vocabulary  of  the  NTDS.  The 


Table  1:  Overview  of  the  LMDS  Configuration  Support 
for  Command  and  Control  Task  Areas 

LMDS  CONFIGURATION 


User  Tasks/ 

Job  Requirements 

Digitizer 

Tablet 

Single  15-in. 
Display 

Track  Designation 
Cursor  Positioning 

adequate 

adequate 

Alphanumeric  Data 

Input 

poor 

adequate 

Alphanumeric/Tactical 
Data  Display 

limited 

Menu/Function 

Selection 

adequate 

adequate 

Procedural/Interface 

Tasks 

software 

dependent 

software 

dependent 

Track  Designation/Cursor  Positioning.  The 
operator  must  be  able  to  designate  individual  tracks 
in  order  to  perform  some  function  (identification, 
interrogation,  data  amplification,  etc.).  He  must 
also  be  able  to  position  a  cursor  to  select  func¬ 
tions,  menu  items,  and  place  graphics  at  precise 
locations  upon  the  display.  These  tasks  are  per¬ 
formed  by  using  a  trackball  device  on  existing 
consoles.  A  recent  NOSC  study  compared  a  digitizer 
tablet  with  a  1:30  control/display  (C/D)  ratio  to  a 
trackball  device  for  a  cursor  positioning  task. {1] 
This  study  indicates  that  the  tablet  device  will 
yield  comparable  performance  to  existing  consoles 
for  cursor  positioning  task;. [2] 

Alphanumeric  Data  Input.  Requirements  for 
alpha  character  data  entry  vary  across  operator 
positions  and  ship  types.  For  example,  operators 
may  add  to  the  graphic  track  symbols.  Certain 
higher-level  AEGIS  ship-class  operators  may  enter 
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doctrine  data  thrc  .g,  a  3  x  6  button  array  of 
alphabet  character .  The  operator  oust  page  back 
and  forth  between  two  arrays  which  each  contain 
half  of  the  alphabet.  The  paging  has  shown  to  be 
cumbersome ,  but  practical,  for  minimal  data  entry. 
Indications  are  that  future  requirements  for  alpha 
data  Input  will  Increase  for  tasks  such  as  doctrine 
entry,  mission  planning  parameter  entry,  end  deci¬ 
sion-aid  use.  Numeric  data  are  entered  through  a 
numeric  keypad  called  the  Digital  Data  Entry  Unit 
(DDEU) .  This  device  is  used  for  a  variety  of  tasks 
as  tracks  are  commonly  referred  to  by  their  four¬ 
digit  identification  number.  The  digitizer  tablet 
will  present  special  problems  for  alphanumeric  input 
tasks  due  to  the  lack  of  tactile  feedback  and  limited 
possibility  for  auditory  feedback  due  to  background 
noise.  Tactile  feedback  may  be  provided  in  the  form 
of  'locator  bumps'  on  the  tablet;  however,  this  may 
aid  function  selection  tasks  much  more  than  data 
input  tasks.  Errors  of  character  emission  and  double 
entry  will  likely  be  much  higher  than  a  conventional 
keyboard.  Studies  are  planned  to  investigate  the 
need  for  a  fold-down  keyboard  for  alphanumeric  data 
input. 

Alphanumeric  and  Tactical  Data  Display.  Modern 
C2  systems  can  quickly  overload  operators  with  irrel¬ 
evant  and  densely  formatted  data.  A  basic  design 
question  is  how  many  displays  are  needed  and  how  to 
appropriately  format  and  combine  alphanumeric  and 
graphic  data.  Current  state-of-art  consoles  usually 
include  the  Planned  Position  Indicator  (PPI) , 
or  radar  screen  as  the  primly  display  supported  by 
an  adjacent  CRT  for  alphanumeric  data.  This  configur¬ 
ation  is  based  more  upon  engineering  feasibility  than 
consideration  for  what  the  operator  needs  to  perform 
his  job.  Digital  radar  and  emerging  scan  conversion 
techniques  make  possible  the  conversion  of  alphanu¬ 
meric  and  sensor  (tactical)  data  on  a  single  display. 
Techniques  such  as  windowing  and  scrolling/paging 
may  permit  a  single  display  to  provide  all  the  data 
needed  by  the  operator.  The  feasibility  of  a  single 
display  will  be  considered  in  future  NOSC  studies. 

A  recent  study  of  tactical  symbology  fonts  for 
raster  displays  established  a  baseline  size  for 
raster-drawn  symbology. [3]  A  current  NOSC  study  is 
evaluating  different  symbology  font  styles  which  are 
derived  from  this  baseline  font.  The  objective  of 
these  studies  is  to  produce  symbology  which  is 
'compact'  and  minimizes  display  clutter  while  being 
readable  at  adequate  error  levels. 

Menu  Formats/Function  Selection.  A  diverse 
array  of  design  questions  are  included  in  this  topic 
area.  First,  the  C/D  ratio  must  be  optimized  for 
fine  and  gross  cursor  positioning  tasks.  The  opti¬ 
mum  ratio  will  determine  minimum  tablet  size.  The 
menu  formats  and  method  of  function  selection  are 
related  issues  which  will  require  careful  study. 
Function  selection  methods  include:  single  action, 
single  action  with  software  delay,  double  action, 
multiple  actions.  Major  considerations  in  metnod 
selection  are  the  severity  of  an  input  error, 
desired  response  speed,  isolation  of  the  function  on 
the  display  page,  and  multiple  sequence  tasks.  The 
guiding  rule  for  method  selection  is  minimize  the 
number  of  alternate  methods  and  be  consistent  in 
their  application. 

General  Interface  Issues.  A  number  of  diverse 
issues  fall  under  the  general  category  of  tasks 
related  to  the  overall  operating  structure.  Feed¬ 
back  requirements  include:  input  verification, 
process  completion  verification,  atate-of-system, 
and  error  messages. [&]  User-feedback  has  been  most 
noticeably  neglected  in  present-day  Naval  combat 


systems.  Given  the  training  structure  defined 
earlier,  the  system  needs  to  accommodate  novice  and 
expert  users.  Tutorials,  training  modes,  and  menu 
by-pass  features  are  needed  for  various  user  skill 
levels.  Programmable  default  options  with  edit 
capability  should  be  available  to  users.  Users 
should  be  able  to  'verify  to  proceed'  rather  than 
be  required  to  fill  in  menu  selections.  Multiple 
default  lists  would  be  extremely  valuable  for  tac¬ 
tical  symbology  and  display  graphics.  The  menu  net¬ 
work  structure  must  be  compatible  with  user  expec¬ 
tancy  and  should  minimize  display  paging  and  search 
time  within  a  given  frame. 

Conclusions 

This  paper  has  touched  upon  a  number  of  human 
factors  issues  which  arise  in  the  design  of  a  multi¬ 
purpose  cosmand  and  control  console  for  emerging 
combat  direction  systems.  NOSC  will  be  conducting 
an  ongoing  research  program  to  address  a  variety  of 
interface  issues,  some  specific  to  the  current  U4DS 
project,  others  applicable  to  a  generic  console. 
Operator  needs  are  often  neglected  in  current 
command  and  control  interfaces,  most  notably  the 
lack  of  proper  feedback,  and  complex  procedures 
which  require  extensive  learning  while  placing  a 
heavy  demand  upon  operator  memory.  With  a  compre¬ 
hensive  human  factors  program,  many  interface  prob¬ 
lems  can  be  addressed  early  in  the  console/cambat 
system  development  cycle,  to  ensure  that  past  design 
errors  are  not  needlessly  repeated. 

References 

1.  Davenport,  E.W.  and  Bemis,  S.V.  Tracking  Per¬ 

formance;  Trackball  vs.  Digitizer  Tablet. 
Technical  Report  824,  Naval  Ocean  Systems 
Center,  San  Diego  CA,  1982. 

2.  Albert,  A.E.  "The  Effect  of  Graphic  Input 

Devices  on  Performance  in  a  Cursor  Positioning 
Task. "  Proceedings  of  the  Human  Factors 
Society  26th  Annual  Meeting.  1982 ,  54-58 . 

3.  Osga,  G.A.  An  Evaluation  of  Identification 

Performance  for  Raster  Scan  Generated  NTDS 
Symbology.  Systems  Exploration,  Inc.,  Report, 
Contract  No.  N0012 3-80-D-0263 ,  San  Diego  CA, 
June  1982. 

4.  Gould,  3.0.  "Man-Computer  Interfaces  for  Infor¬ 

mational  Systems:  lecture  to  Human  Engineer¬ 
ing  Short  Course,"  in  Huchingson,  R.D.,  New 
Horizons  for  Human  Factors  in  Design,  McGraw- 
Hill,  1981. 


135 


SECTION  m 


Surveillance:  Issues  and  Algorithms 


Preceding  page  blank 


137 


MULTI -SENSOR  FUSION,  COMMUNICATIONS  AND 
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INTRODUCTION 


)  Todays  fusion  problems  are  chiefly  concerned  with 
organizational  and  procedural  issues.  The  technology 
they  employ  is  mostly  available  state-of-the  art. 

The  future  brings  a  new  set  of  concerns  centered 
about  issues  that  are  more  technical  in  nature. 

Future  military  conmand  and  control  and  weapons 
systems  will  likely  be  more  distributed,  more 
automated  and  smarter.  They  will  probably  include  an 
advanced  form  of  information  warfare  where  sensing, 
information  exchange,  janroing,  deception,  and 
misinformation  will  be  capable  of  being  managed  and 
orchestrated  from  a  total  mission  objective 
perspective.  As  a  result,  the  future  fusion  process 
will  be  required  to  handle  and  process  on  orders  of 
magnitude  increase  in  the  volume  and  diversity  of 
input  data,  faster.  It  will  need  to  produce  a  great 
variety  of  information  to  feed  automated  C2  and 
weapons  systems  data  bases  through  more  interactive 
and  responsive  interfaces  than  exist  today.  At  the 
same  time  it  needs  to  analyze  this  data  at  a  deeper 
level  of  understanding  than  ever  before,  scrutinizing 
and  drawing  inferences  and  conclusions  about  ones 
adversaries  underlying  beliefs,  readiness,  intentions 
and  future  actions  from  what  is  often  times  a  suspect 
and  spotty  data  base.  Finally,  these  conclusions  and 
inferences  need  to  be  presented  in  a  clear,  concise, 
honest,  but  convincing  and  timely  tanner.  This  paper 
presents  a  unified  framework  from  whiuh  the  necessary 
information  may  be  fused,  managed  and  presented  to 
support  command  in  such  a  future  information  warfare 
environment  and  discusses  the  associated  technical 
challenges.  This  paper  reviews  various  ongoing 
research  programs  that  are  addressing  these 
challenges. 


INFORMATION  WARFARE  CONTROL  STRATEGY 

When  viewed  in  this  manner,  the  strategy 
illustrated  in  Figure  1  emerges  as  a  reasonable 
information  handling  systems  desicyi  and  control 
scheme.  The  objective  of  this  control  strategy  is  to 
maximize  a  measure  of  the  combined  state  of  knowledge 
of  own  forces  and  state  of  ignorance  of  opponent 
forces  such  as  <  -u)v,  through  judicious  application 
of  the  information  manipulative  functions.  A 
mathematical  expression  that  represents  these 
relationships  has  been  developed  in  references  (2), 
(3),  and  (4).  It  takes  the  form  of  the  following 
differential  equations: 

u»-au-b(  -u)v+c'(  -u) (  -v)+d(  -u) 
v-~a'v-b' (  — v)u+c(  — u) (  -v)+d' (  -v) 
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where 

u  ■  entropy,  or  ignorance,  of  friendly 
v  »  entropy,  oe  igiorance,  of  anemy 
u  *  change  in  entropy  of  friendly 
v  »  change  in  entropy  of  enemy 

a,  b,  e,  d,  a',  b',  c*,  d'  are  positive 
constants. 

Friendly  sensor  measurements,  intercepts  of  enemy 
communications  and  sensor  radiations,  and  human 
intelligence  reports  are  reoeived,  combined,  and 
interpreted  to  form  two  pictures;  friendlies  state  of 
knowledge  of  the  situation  and  the  opponents  state  of 
knowledge.  This  includes  such  information  as  wnac 
units  are  located  where,  their  course,  likely 
destination  and  intentions,  with  indications  of  the 
confidence  associated  with  these  estimates.  Since 
information  warfare  is  a  two-sided  contest,  it  is 
assumed  that  Loth  parties  will  seek  to  select  that 
combination  of  information  manipulative  functions 
which  selves  their  best  interests.  Accordingly, 
friendly  should  select  that  set  of  information 
manipulative  functions  which  results  in  a  maximum 
objective  function  assuming  that  the  opponent  will 
always  select  a  corresponding  set  of  information 
manipulative  functions  designed  to  minimize  the  same 
objective  function.  This  is  a  worst  case  approach. 

It  might  prove  desirable  to  provide  the  decision 
maker  with  other  possible  control  strategies  and 
their  possible  outcomes;  such  as  one  based  upon  a 
best  case  analysis  and  another  based  upon  a  most 
likely  (derived  from  historical  precedence)  enemy 
response. 

Of  course  the  end  objective  is  to  prevail  in 
battle,  maximizing  the  enemies  losses  and  minimizing 
ones  own  losses.  It  is  shewn  in  reference  (2)  that 
the  amount  of  knowledge,  or  information,  a  combatant 
possesses  directly  influences  his  combat 
effectiveness.  Accordingly,  it  is  recommended  that 
prior  to  the  selection  for  execution  of  any  set  of 
information  manipulative  functions,  a  projection  of 
battle  outcome  be  determined  which  includes  the 
likelihood  of  enemy  initiation  of  conflict  and  the 
predicted  losses  and  associated  risks.  This  battle 
outcome  prediction  should  be  presented  along  with 
information  manipulation  control  strategies  so  that  a 
course  of  action  may  be  determined  which  includes 
consideration  of  the  initiation  of  conflict  as  well 
as  pure  information  warfare  actions. 


FUSION  PROCESS 


We  have  discussed  how,  in  the  future  there  will 
be  more  sophisticated  and  capable  systems  providing  a 
greater  diversity  and  quantity  of  more  timely  data 
"or  fusion  and  analysis.  The  enemy,  however,  will  be 
more  sophisticated  and  capable  in  deception, 
camouflage  and  decoy,  consequently,  we  will  be 


required  to  make  use  of  "his  increased  data  just  to 
accurately  monitor  the  enemy  without  being  surprised 
or  fooled.  Moreover,  the  increased  tempo  of  battle 
and  the  needs  of  the  unified  information  warfare 
strategy,  require  us  to  analyze,  process,  and  fuse, 
this  increased  volume  of  data  in  even  shorter  time. 
All  this  highlights  the  need  for  an  improved  more 
responsive  fusion  process. 

More  specifically,  we  are  concerned  with  the 
following  sorts  of  improvements: 

1.  The  percentage  of  data  received  that  is 
profitably  used  (exploitation  of  current  data. 

2.  The  volume  of  data  that  can  be  processed  in 
a  fixed  amount  of  time  (system  capacity) . 

3.  The  ability  to  select  and  prioritize  input 
data  to  be  processed  and/or  disgarded  by  its  value 
added  (selective  processitig) . 

4.  The  speed  at  which  data  can  be  received  and 
processed  (timeliness  &  throughput). 

5.  The  quality  of  the  information  output  • 
(information  credibility,  reduction  of  ambiguity, 
resistance  to  camouflage,  cover  and  deception, 
confidence  in  predictions) . 

6.  The  effective  use  of  the  historical  archival 
data  (exploitation  of  archival,  historical  data) . 

7.  The  behavior  of  the  system  under  data 
overload/saturation  (system  stability) . 

8.  The  manpower  requirements  (number,  skill 
level,  training,  fatigue). 

9.  The  relevance  of  the  processed  information 
to  the  situation  and  the  decision-makers  needs 
(information  tailoring,  responsiveness/relevance, 
anticipation  of  information  needs,  and  data 
reduction). 

10.  Effectiveness  of  the  3ensor  collection 
tasking. 

11.  The  integration  of  analysis  products 
produced  at  different  contributing 
sites/organizations  and  locations  and  the  resolution 
of  collection  tasking  conflicts  and  priorities 
amongst  these  various  locations  (analyst  inter-site 
integration). 

Our  primary  interests  are  with  technologies 
concerned  with  the  processing,  organization, 
retrieval,  presentation  and  dissemination  of  data  and 
information.  Deeper  interests  lie  in  gaining  a 
greater  understanding  of  the  theories  of  human 
cognition  and  reasoning  and  of  plausible  inference 
and  deduction  as  applied  to  the  intelligence  analysis 
process.  At  all  levels  of  this  process  there  is  a 
need  for  better  ways  to  process,  reduce,  su'd 
interpret  data  in  order  to  produce  timely  and 
relevant  information. 


The  tectmical  challenges  vary  with  the  stage  of 
the  intelligence  analysis  process  and  the  form  of  the 
data  at  that  stage.  It  includes  such  areas  as  high 
speed  signal  processing  and  pattern  recognition, 
symbolic  representation  and  amputation,  extremely 
high  density  storage,  image,  speech  and  text 
understanding,  distributed  hypothesis  generation  and 
problem-solving,  planning  and  scheduling. 


The  stages  of  the  intelligence  process  are  of 
course  highly  interactive  in  that  data  does  not  flow 
neatly  left  to  right  as  implied  in  Figure  2,  or  in 
any  fixed  order.  Rather,  an  intelligence  analysis 
problem  gets  worked  simultaneously  at  all  stages  or 
levels  (top-down,  bottom-up,  and  from  the  middle) 
with  tightly  coupled  feedback  loop*  between  all 
stages  or  levels.  Further,  analysis  is  inherently  a 
human-directed  process  where  the  information 
processing  techniques  developed  would  serve  as  an 
assistant,  amplifier,  and  advisor  to  the  intelligence 
analyst,  and  not  as  a  replacement.  Accordingly  they 
should  be  designed  to  allow  an  analyst  to  do  his  job 
better,  consider  more  variables,  and  work  faster  and 
more  efficiently.  Because  of  these  attributes  an 
architecture  is  proposed  that  borrows  from  the 
Hearsay  m  Blackboard  model  (a  federation  of  experts 
who  collaborate  on  solving  a  problem  through  the 
mutual  sharing  and  updating  of  a  canon  hierachically 
structured  multiple  view  of  the  evolving  problem 
state).  Key  components  of  this  architecture  is 
illustrated  in  Figure  3  and  is  explained  below. 


FE-MURE/PAFAHSTTO  EXTRACTOR 

Sensor  data  comas  initially  processed  and  in 
multiple  forms:  formatted  messages  with  features  and 
parameters  identified,  narrative  text,  and  imager/. 

In  seme  cases  even  the  pre-processed  signal  video 
(telemetry,  ELINT  waveforms  and  acoustic  lofargram 
data)  is  available.  It  is  assumed  that  all  ELimr 
data  gets  processed  and  reduced  t  a  message  (fixed 
format  or  narrative)  at  the  coll  don/sensor  site. 
The  raw  imagery  ana  sigial  video  provided  would  be  in 
addition  to  the  preliminary  processing  at  the 
collection  site  and  to  be  used  for  more  in-depth 
analysis.  Thus,  when  data  enters  it  goes  two  places: 
initial  feature  parameter  analysis;  and  data  storage 
and  retrieval.  The  feature  parameter  analysis 
Junction  is  primarily  a  pattern  ^eco^iition 
syntactical  and  statistical  analysis  process  where 
features  and  parameters  such  as  prf,  modulation 
parameters,  harmonic  pairs,  image  ribbons  and  .exture 
measures  get  reduced  to  various  aoabinations  of 
discriminant  categories  and  classes  which  are  then 
sequentially  tested  against  an  existing 
parameter /feature  data  base  constrained  try  associated 
collateral  data  such  as  candidate  object  locations 
and  characteristics,  and  time  of  observations.  The 
objective  of  this  sequential  test  process  is  to 
reduce,  through  elimination,  contradiction  and 
deduction  possible  associations  between  received 
feature  and  parameters  to  as  small  a  set  of  feasible 
candidate  objects,  events  and  event  hierarchies  as 
possible.  The  result  of  this  process  produces  mixed 
results.  Some  discriminants  have  very  strong  unique 
associations  with  a  specific  object  or  event,  others 
can  only  be  ambiguously  or  probabilistically 
associated  with  one  or  more  objects  and  events. 

There  are  several  technologies  that  would  De  useful 
in  this  area:  high  speed  sigial  processors  (e.g. 

VHSIC,  array  processors,  systolic  processors)  would 
be  needed  to  execute  the  compute-intens i ve 
multidimensional  pattern  and  cluster  analysis 
algorithms  required  against  the  potentially  high 
volume  of  input  data.  Natural  language  processing 
techniques  are  needed  to  reduce  the  time  and  manpower 
currently  needed  to  interpret  the  large  volumes  of 
narrative  messages  into  formats  suitable  for  further 
computer  processing.  The  raw  sensor  data  is  assumed 
to  be  first  stored.  It  is  retrieved  and  processed 
only  in  response  to  special  directed  analyses  and 
questions  that  get  identified  further  down  in  tne 
analysis  process.  Discussions  as  to  how  this  data 
gets  retrieved  and  processed  will  be  discussed  in 
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greater  detail  in  the  section  on  rata  Storage, 
Retrieval  and  Inference. 


IWtESPRETER/OCMCEPT  GENERATOR 

the  Intrepreter/Concept  Generator  attempts  to 
logically  relate  the  data  pertinent  to  the  current 
situation/state  description  to  logical  chunks  of 
information  related  to,  or  associated  with,  concepts, 
objects  events  and  event  hierarchies.  This  logical 
association  can  probably  be  ntost  naturally  expressed 
in  the  form  of  frame/script  based  hierarchies.  Each 
object  will  have  slots  representing  attribute  data 
such  as  physical  features,  electronic  order  of 
battle,  current  location,  past  locations,  speed,  and 
associated  events,  event  hierarchies  and  rules  of 
behavior.  Events  will  include  such  items  as  ’missile 
firing’  or  ’report  bade*.  Each  event  chunk  (or 
concept)  will  have  slots  associated  with  such 
attributes  as  measured  observables,  action  taken, 
involved  objects,  location  and  time.  As  mentioned 
earlier,  events  can  in  turn  be  associated  with  other 
events  as  an  identifiable  time  and/or  order  dependent 
sequence  hierarchy;  e.g.  ’coordinated  attack 
operation*,  as  data  is  placed  in  the  data  base  the 
Interpreter  attempts  to  fit  the  data  to  slots  in  the 
various  object  and  event  frames/scripts.  Each  object 
and  event  frame  have  demons  (embedded  procedures) 
that  get  triggered  as  a  function  of  the  specific  data 
clues  or  operations  to  look  for  inconsistencies, 
establish  linkages  or  alerts,  perform  special 
computations  and  take  other  actions.  The 
Interpreter/Ooncept  Generator  maintains  the  common 
hierarchically  structured  multiple  view  of  our 
collective  knowledge  of  the  evolving  problem  state. 

It  should  be  capable  of  linking  together  one  or  more 
parameters,  objects,  end  events  with  an  associated 
confidence  and  propagating  that  confidence  measure  19 
the  connon  multiple  view  hierarchy  and  across  a  chain 
of  logical  deductions. 


HYPOTHESIS  GENERATOR 

The  hypothesis  generator  is  concerned  with 
higher  level  aspects  of  the  current  situation  under 
study  such  as  enemy  strategy  and  intent.  It  attends 
to  associate  or  infer  what  is  the  purpose  or 
motivation  behind  the  observed  sequence  of  events; 
e.g.  exercise,  heightened  state  of  readiness,  a 
planned  deception  some  other  operation.  This  is  one 
of  the  more  difficult  areas  to  deal  with  and  is  one 
of  the  more  highly  intuitive  and  unstructured 
functions.  Accordingly  it  is  envisioned  as  being  the 
most  manually-driven  of  the  functions  discussed  so 
far.  In  the  Feature/Par amater  Extraction  function 
and  in  the  Interpreter/concept  Generator  function, 
the  data,  knowledge  representation  and  inferencing 
structure  Will  likely  be  defined  in  advance  with  the 
process  being  fairly  automatic;  the  analyst 
interacting  on  an  exception  basis. 

In  contrast  to  this  approach,  the  formation  and 
the  selection  of  hypotheses  is  expected  to  be  a 
dynamically! changing  function  assisted  fcy  machine 
prompts,  suggestions  and  aids,  but  closeiy  controlled 
and  managed  by  an  analyst.  For  example,  the 
Hypothesis  Generator  should  be  able  to  generate 
canned  predictions  of  event  sequences  and  object 
inter-relationships  for  several  pre-specified  enemy 
plans  and  strategies  as  a  function  of  the 
environmental  and  situation-specifics.  These  initial 
event  sequence  predictions  are  anticipated  as  merely 
being  a  departure  point  for  the  analyst  who  should 


have  the  capability,  at  a  fairly  high  level  of 
abstraction,  to  modify  and  adjust  the  initial 
hypothesis  to  a  nore  suitable  one  of  his  chocsing. 

The  hypothesis  generator  should  be  capable  of 
reevaluating  the  feasibility  of  two  or  more 
hypotheses  simultaneously.  At  least  one  hypothesis 
should  act  as  the  duty  skeptic,  constantly  checking 
for  the  possibility  of  deception,  looking  for  the 
notable  absence  as  well  as  the  receipt  of  confirming 
data.  This  is  something  the  hunan  finds  very 
difficult  to  do.  psychological  studies  reveal  that  a 
human  typically  forms  a  single  hypothesis,  and  once 
formed,  he  ter.ds  to  look  for  confirming  evidence 
only.  The  incoming  data  and  previously  derived 
evidence  should  be  eontinously  reviewed  for 
indications  of  some  change  to  any  of  the  set  of  all 
possible  hypotheses  and  it  should  alert  the  analyst 
when  any  significant  change  in  the  rankings  of  any  of 
the  hypotheses  occurs;  a  previously  likely  hypothesis 
becomes  suspect;  a  new  hypothesis  other  than  the  ones 
currently  under  serious  consideration  is  suggested; 
or  perhaps,  wtv-n  there  is  not  good  fit  of  any 
hypothesis  to  trie  evidence  at  hand,  in  addition,  the 
hypothesis  generator  should  have  both  pre-set 
heuristics  and  user  -  specificable  and  adjustable 
heuristics  that  examine  the  incoming  evidence  for 
clues  of  confirmation  or  denial  of  currently  favored 
hypotheses,  and  to  assist  in  the  nomination  and 
formulation  of  new  hypotheses.  These  heuristics 
should  make  use  of  such  indicators  as  activity 
patterns  and  statistics,  the  observed  time,  order, 
and  sequence  of  events,  user-specified  interesting 
and  unusual  associations  or  unexpected  trends, 
negative  information,  and  recognition  of  inconsistent 
patterns  and  inferences.  These  heuristics  can 
generally  be  expressed  as  an  unordered  collection  of 
’IF,  THEN*  conditional  statements  or  production 
rules.  Thus,  the  hypothesis  generator  is  envisioned 
as  borrowing  heavily  from  the  expert  system 
production  rule  methodology.  It  should  be  capable-  of 
operating  both  in  an  automatic  data-driven  mode, 
alerting  the  analyst  when  some  change  of  interest  has 
occurred,  as  well  as  being  able  to  be  interrupted, 
tasked  and/or  queried  by  an  analyst  in  more 
goal-directed  search  mode.  These  predicted 
activities  are  transmitted  over  a  cooperative  analyst 
network  foe  confirming  evidence  in  support  of  the 
generated  hypotheses.  In  addition,  the  hypothesis 
generator  exchanges  alerts  over  the  analyst  net 
whenever  unusual  or  highly  interesting  alerts, 
situations,  or  indications  arise.  The  hypothesis 
generator  is  used  to  drive  the  collection 
manager/test  planner  to  prioritize  the  order  in  which 
incoming  data  is  processed  directs  the  browsing 
through  of  historical  data. 


COLLECTION  MANAGER/TEST  PLANNER 

The  collection  manager/test  planner  13  concerned 
with  assisting  the  analyst  in  providing  appropriate 
feecfcack  and  tasking  to  the  collection  sensors  and 
sources.  The  analyst  has  the  opportunity  to  resolve 
inconsistencies,  remove  ambiguities,  and  to  decide 
between  hypotheses  by  appropriately  cuing,  alerting, 
and  tasking  the  collector  for  confirming  or  predicted 
data.  To  provide  this  capability  the  collection 
manager/test  planner  naintains  a  data  base  concerning 
the  various  collection  assets,  their  availability  and 
appropriate  physical  characteristics.  Both  the 
interpreter/concept  generator  and  the  hypothesis 
generatoc  signal  the  collection  manager/test  planner 
for  assistance  whenc-'er  they  encounter  a  conflict  or 
anbiguity  that  requires  resolution  or  whenever  they 
have  reached  a  dead-end  in  their  analysis.  The 
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collection  inanager/test  planner  produces,  based  upon 
the  set  of  initial  hypotheses,  a  decision  tr^e  of 
collection  measurements  from  which  a  best  set  of 
measurements,  is  chosen,  The  best  set  of  roeasu  aments 
is  that  set  which  will  bring  the  system  closest  to 
reducing  the  candidate  hypothesis  to  a  single  one 
under  such  user -specif ied  constraints  as  the  cost  and 
likelihood  of  the  collection  tasking  request  being 
satisfied  and  its  probable  outcome.  These 
conditional  probability  outcomes  are  determined  from 
both  rules  previously  provided  by  che  analyst  or 
?oprox’'met^  '  oy  the  output  of  the 
predictor /simulator,  discussed  next.  The  collection 
manager  test  planner  can  be  asked  to  provide  a  second 
or  third  best  collection  strategy,  or  can  be 
interrupted  and  asked  to  provide  a  best  collection 
strategy  based  upon  a  new  set  of  constraints  supplied 
by  the  analyst.  Once  a  collection  strategy  is 
selected  by  the  analysis,  the  appropriate  tasking 
messages  will  be  automatically  generated  and 
submitted  for  analyst  approval  prior  to  their 
submission  to  the  appropriate  collection  tasking 
organisations.  These  same  collection  tasking 
requests  will  also  be  transmitted  across  the 
inter -analyst  net  to  other  cognizant  analyst 
organizations  working  in  related  areas  for 
coordination  and  for  internal  resolution  of  tasking 
conflicts  and  priorities. 


PREDICTOR/SIHJLATOR 

The  predictor/simulator  is  a  fast  simulation 
model  that  simulates/models  the  motions,  behaviors, 
emissions  and  detections  of  target  objects  and 
sensors  in  a  faster  than  real  time  mode.  It  is  input 
with  high  level,  fairly  abstract  descriptions  that 
are  outputted  by  the  interpreter/concept  generator, 
the  hypothesis  generator  and  the  analyst.  These 
inputs  should  be  able  to  be  viewed  and  modified  by 
the  analyst.  The  output  of  the  predictor/simulator 
would  be  to  move  forward  in  time  and  to  predict  the 
measurements  that  would  result  from  candidate 
sensor/collector  taskings  and  their  associated 
probabilities.  Applicable  technologies  are  the  use 
of  multi-processors  to  achieve  the  desired  simulation 
execution  sp.eds  and  of  rule-based  descriptors  of  the 
objects  and  events  to  allow  for  sufficient  simulation 
flexibility  and  for  good  user  interface.  The  RAND 
Rule  Oriented  Simulation  System  (ROSS)  is  a  candidate 
technology  for  system.  It  has  provisions  to  include 
rules  of  logic  and  behavior  of  the  objects,  models  of 
the  units,  and  sensors  capabilities  and  movements,  as 
well  as  models  of  the  environment. 


HATCH  EP/COM  PARER 

The  matcher/comparer  coinpares  the  incoming 
signal  data  at  both  the  feature/parameter  and  the 
object/concept  level  with  the  outputs  of  the 
predictor /simulator .  Either  it  finds  Gi.  it 

attempts  to  adjust,  within  the  allowable  bounds  of 
the  candidate  hypotheses,  the  inputs  of  the 
predictor/sioulator  with  respect  to  object 
orientation,  motion  and  behavior  in  order  to  achieve 
a  match.  If  a  match  is  achieved,  the  appropriate 
portions  of  the  comaon  multiple  view  representation 
of  the  situation  under  study  is  updated 
appropriately.  If  no  match  is  found,  this  fact  and 
the  amplifying  data  is  reported  to  the  analyst  for 
further  action.  Much  of  this  system  is  anticipated 
as  being  highly  quantitative  and  computation  intense 
VHSIC  technology  and  advanced  pattern  matching 
algorithms  appear  appropriate  to  meeting  this  system 
requirement. 


DATA  STORAGE,  RETRIEVAL  &  INFERENCES 

The  data  base  is  envisioned  as  being  extremely 
large,  ltA4  bytes  or  jreater.  It  will  contain  a 
.mixture  of  data  and  information  types  and  forms 
including:  historical  narrative  data,  imagery  and 
formatted  computer  data  including  hierarchical 
multiple  views  of  the  past,  current  and  predicted 
state  of  objects,  events  and  missions  under 
analysis.  The  size  and  complexity  of  such  a  data 
base  dictates  a  hierarchy  of  advanced  storage  media, 
such  as  magnetic  tape,  optical  disc,  and  random 
access  memory.  Associate  processor  tedwology,  array 
processors,  systolic  arrays,  or  the  like  may  be 
required  for  rapid  content-addressable  data  retrieval 
and  update  ("smart"  memories).  Heavy  use  is 
anticipated  of  the  technologies  of  predicate 
calculus  and  deductive  inferencing  so  that 
information  may  bo  queried  and  retrieved  that  is  not 
explicitly  stated  but  that  may  be  derived  or  deduced 
from  data  that  is  explicitly  stated  and  stored.  Aids 
should  be  provided  to  allow  the  analyst  to 
selectively  browse  through  a  large  text  or  imagery 
data  base.  The  analyst  should  be  able  to  identify 
selected  subjects,  keywords,  sounds,  spatial  features 
and  relationships  of  interest,  and  the  context  under 
which  they  are  of  interest.  The  data  storage, 
retrieval  and  inferencer  should  then  be  able  to 
select  relevant  text,  images,  etc.  for  presentation 
to  the  analyst  with  a  high  confidence  that  all 
relevant  items  and  only  relevant  items  will  be 
retrieved.  If  the  numoer  of  items  selected  for 
retrieval  exceeds  some  reasonable  threshold,  the 
analyst  will  be  informed  of  the  number  of  qualifying 
items.  He  may  then  request  to  see  them  ail 
one-by-one,  or  he  may  choose  to  add  additional 
qualifiers  and  constraints  to  his  request  for  data, 
lb  satisfy  this  requirement,  the  systems  needs  two 
capabilities:  namely,  the  ability  to  use  the 
situation  peciliar  context  to  translate  the  fairly 
high  level  conceptual  information  requests  to  an 
equivalent  set  of  data  indices  and  pattern  feature 
matches  and  then  to  scan  throe  |h  large  volumes  of 
historical  textual  and  imager^  data  for  matches  of 
the  appropriate  combinations  of  keyword,  spatial 
features,  and  patterns  at  a  speed  commensurate  with 
the  transfer  rates  of  the  storage  medium,  vhis 
requires  some  impressive  lower  level  feature  and 
pattern  extraction  processors  interfaced  to  a  fast 
symbolic  processor.  (Reference  (5)) 


DATA  CHECKER  AUDITOR 

Closely  related  to  the  data  storage,  retrieval 
and  inference  is  the  data  checker  and  auditor.  This 
system  acts  as  an  independent  data  auditor  constantly 
checking  the  collective  data  for  internal  consistency 
and  plausibility.  It  should  look  for  contradictions, 
exclusions,  and  negative  evidence.  This  system  is 
also  concerned  witn  data  compaction  and  housekeeping; 
e.g.  the  merging,  purging,  and  forgettirq  of 
data/information.  To  achieve  this  sart  of  capability 
it  is  anticipated  that  this  system  desiyi  will  borrow 
heavily  from  the  ideas  of  Truth  Maintenance  and 
theorem-proving  and  plausible  reasoning. 


PRESENTER 

Trie  presenter  is  envisioned  as  providing  'he 
man-machine  interface,  both  locally  to  assist  uie 
analyst,  and  externally  to  provide  the  user  his 
information  needs  and  to  support  conrainicationc 
between  the  analyst  community.  As  noted  earlier, 
this  inter -analyst  network  is  needed  to  allow  a  more 


142 


effective  integration  and  coordination  of  the 
products  of  the  analysis  community;  e.g.  means  to 
better  distribute  the  intelligence  analysis  load  and 
to  share  the  intelligence  product,  as  well  as  to 
better  coordinate  and  resolve  conflicts  and 
priorities  in  collection  taskings  related  to  thi3 
common  distributed  problem-solving  activity.  The 
presenter  should  rake  use  of  such  advanced  features 
*s  natural  language  and  speech  input  and  output, 
menus,  graphics  and  explanation  facilities  to  explain 
the  rationale  and  evidence  and  line  of  reasoning 
behind  any  intelligence  product.  But  it  should  do 
more  than  this,  it  should  maintain  a  model  of  the 
user  and  how  his  information  needs  vary,  dependent 
upon  the  situation.  It  should  use  this  model  to 
alert  the  user  to  situations  of  interest  as  th-’y 
cccur,  and  to  reduce  and  to  tailor  the  needs  in 
support  of  his  current  decisions  as  opposed  to 
drowning  him  in  an  information  overload  situation. 

The  system  should  anticipate  his  need  for  additional 
further  information,  ltiis  system  is  what  makes  the 
fusion  cen*-  u  appear  responsive,  timely,  and  relevant 
to  the  user  without  overloading  or  overburdening  him 
with  more  data,  at  a  rate  faster  than  he  is  capable 
or  cares  to  absorb  (human  bandwidth  is  approximately 
10-40  Hz).  There  are  techniques  whereby,  once  this 
capability  is  ,n  place  that  it  can  be  used  to  reduce 
the  communications  overload  normally  associated  with 
data  base  query  and  update  (reference  (6)).  Finally, 
multi-level  security  will  have  to  be  addressed.  We 
need  to  develop  processors  fast  enou^i  to  handle  the 
multi-level  security  overhead  without  significant 
performance  degradation  resulting. 
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Summery 


The  increasing  sophistication  of  intelligence  collection  ' 
and  analysis  systems  has  jiven  US  decision  makers  a 
powerful  tool  to  evaluate  the  actions  and  intentions  of  our 
potential  adversaries.  At  ti,e  same  time,  however,  these 
advances  have  in  some  rcspecis  increased  our  susceptability 
to  the  skillful  use  of  cover  and  deception  techniques. 
Throughout  history,  the  potential  success  of  ‘O'  ft  P*' 
operations  has  been  determined  soley  by  the  skill  of  the 
practitioner,  regardless  of  the  sophistication  of  the  intended 
victim  in  conducting  C  4  D  operations.  Today,  we  face  in 
the  Soviet  Union  a  nation  which  has  both  recognized  the 
importance  of  C  &  D  and  has  over  the  years  demonstrated 
an  impressive  capability  to  deceive  and  mislead  both  its 
intended  victims  and  the  US  and  its  allies.  This  paper 
outlines  the  salient  characteristics  of  C  ft  D,  Soviet 
doctrine  and  application  and  some  of  the  techniques  which 
could  be  used  to  uncover  cover  and  defeat  deception, 

The  Operational  Context  for  Cover  and  Deception 
What  is  C  ft  D? 

"Cover"  denies  an  adversary  the  intelligence  data 
needed  to  plan  and  carry  out  operations,  and  it  includes  both 
camouflage  and  avoidance.  Camouflage  can  be  either 
passive,  in  which  case  it  attempts  to  make  the  threatening 
activity  appear  either  benign  or  not  appear  at  all,  or  active, 
in  which  threatening  activity  is  simulated  where  it  does  not, 
in  fact,  exist.  Avoidance  exploits  knowledge  of  the 
adversary's  collection  capabilities  and  operational  use  to 
deny  reconnaissance  opportunity. 

"Deception"  seeks  to  use  both  camouflage  and 
avoidance,  together  with  genuine  but  misleading  activity,  to 
manai>e  an  adversary's  perception  of  events,  capabilities  and 
planned  actions. 

A  skillful  user  of  C  ft  D  seeks  to  provide  an  adversary 
with  lieces  of  information  which  appear  genuine  in  them¬ 
selves,  and  which  fit  a  course  of  action  which  the  adversary 
would  find  reasonable.  In  this,  the  C  ft  D  practitioner 
attempts  to  exploit  the  anchoring  bias  of  the  cognitive 
process  [1],  by  presenting  the  strongest  indications  of  the 
deception  story  first.  If  the  intended  victim  has  already 
formed  an  estimate  of  the  most  likely  course  of  action,  the 
practitioner  need  only  take  those  actions  necessary  to 
provide  substantiating  evidence.  Once  the  victim  has 
focused  on  a  single  most  likely  course  of  action,  receipt  of 
later  information  will  be  evaluated  in  terms  of  whether  or 
not  it  matches  the  current  hypothesis.  The  victim  may  then 
ignore  contradictory  evidence,  fit  ambiguous  evidence  to 
match  the  hypothesis  cs  if  no  ambiguity  existed,  and  accept 
deceptive  activity  with  little  scrutiny. 

Cover  and  Deception  and  the  intelligence  Process 

The  use  of  C  4  D  extends  across  the  conflict  spectrum, 
and  applies  to  other  dimensions  of  military/political 
analysis.  Cover  and  oeception  has  been  successfully  applied 


to  manage  perceptions  during  peacetime,  crisis  and  war. 
Similarly,  C  ft  D  is  a  major  concern  in  successful  application 
of  arms  control  and  to  the  maintaining  the  validity  of  a 
deterrence  policy.  The  operational  context  for  C  ft  D  is 
shown  in  Figure  I. 

The  opposing  decision-maker  selects  both  an  operations 
plan  and  a  C  ft  D  plan,  the  first  to  acheive  his  objectives, 
and  the  second  to  manage  friendly  perceptions  in  such  a  way 
that  counter-action  will  be  misdirected  or  mistimed.  These 
plans  are  executed  as  a  course  of  action  (1),  apects  of  which 
ran  be  observed  if  friendly  collection  assets  are  present  and 
active  when  they  occur  (2).  Threat  assessment  (3)  attempts 
to  correlate  information  collected  with  a  concept  of  how 
opposing  forces  would  be  used  under  a  variety  of 
circumstances,  to  produce  an  evaluation  of  what  the 
opposing  forces  are  trying  to  do.  The  friendly  decision 
maker  then  selects  a  response  (4),  based  on  set  policy  and 
the  capabilities  of  his  own  assets. 

The  principle  objective  of  the  intelligence  process  is 
knowledge  the  opposing  decision-maker's  plan.  Usually,  this 
cannot  be  gained  directly.  Further,  the  raw  data  which  the 
intelligence  system  receives  is  limited  both  by  the 
attributes  of  military  activity  which  are  observable  and  by 
the  time  slices  when  collectors  are  actually  tasked  to 
collect.  Finally,  the  interpretation  of  activity  depends  on 
the  accuracy  of  our  concept  of  the  opponent's  force 
procedures.  Even  without  any  intent  to  deceive  by  an 
opponent,  the  limitations  of  the  intelligence  process  would 
leave  us  with  an  incomplete  and  sometimes  misleading 
picture  jf  his  activities  and  objectives. 

the  threat  assessment  process  is  vulnerable  to  C  ft  D  at 
each  step.  Figure  2  decomposes  the  process,  and  shows  the 
opportunities  for  a  skillful  opponent  to  employ  C  ft  D.  To 
begin  with,  an  adversary  can  control  the  timing  and  type  of 
-activity  by  his  forces  to  manage  our  perception  of  the 
observable  features  of  his  course  of  action.  Assuming  that 
an  analyst  began  with  an  accurate  baseline  of  enemy 
locations  and  activity,  this  type  of  deception  would  cause 
errors  in  threat  situation  monitoring-monitoring  the  current 
state  of  enemy  forces.  At  this  point,  the  opponent  loses 
direct  control  over  his  ability  to  manage  perception,  but 
must  rely  on  the  weaknesses  of  our  intelligence  analysis 
system.  The  opponent's  C  ft  D  plan  attempts  to  orchestrate 
observable  activity  so  that  collection  distortions  and 
interpretation  errors  are  propagated  through  the  higher 
levels  of  intelligence  analysis. 

Indications  analysis,  relying  on  an  incorrect  statement 
of  the  current  situation,  will  misdirect  requests  for 
additional  collection.  Key  activities  will  be  missed,  and 
others  will  be  assessed  as  having  occurred  when  they  have 
only  been  simulated.  The  final  step  in  the  process,  threat 
synthesis,  matches  key  activities  with  the  hypothesized  set 
of  cou'  xes  of  actions.  If  those  key  activities  are  not 
correct,  v  identified,  or  if  the  set  of  courses  of  action  is 
incomplete,  an  incorrect  assessment  of  the  opposing 
decision-maker’s  intentions  will  be  given  to  our  own 
commanders. 


FIGURE  1  OPERATIONAL  CONTEXT  FOR  C&D 


FIGURE  2  THREAT  ASSESSMENT  VULNERABILITIES  TO  C&D 


/ 
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Unfortunately,  many  of  the  developments  in  intelligence 
systems  in  recent  years  have  increased  our  vulnerability  to 
r  4  hat  the  same  time  that  they  have  increased  our  ability 
to  collect  data  and  to  analvze  activity.  Together  with  a 
greatly  increased  ability  to  collect,  we  have  developed 
systems  to  help  the  human  analyst  exploit  that  capability  by 
focusing  attention  on  items  which  aralysts  have  identified 
as  keys.[2  ]  Thus,  each  enemy  course  of  action  can  be  broken 
down  into  indicators-steps  which  must  be  taken  to  realize 
that  action,  indicators  into  key  activities,  activities  into 
observables.  The  result  is  a  system  of  great  power  for 
focusing  attention  on  significant  pieces  of  information  and 
for  leading  to  conclusions  of  intent  based  upon  a  clear  path 
of  reasoning.  The  weakness  of  this  system  is  that  the 
discriminators  at  each  step  become  high  value  targets  for  an 
opponent's  C  &  n  activities,  and  as  we  discuss  in  the  next 
section,  it  is  highly  likely  that  the  Soviets  will  attempt  to 
expioit  this  weakness. 

Soviet  Cover  and  Deception  Doctrine  and  Applications 

Extensive  use  of  cover  and  deception  techniaues  in  the 
tactical  environment  are  basic  tenets  of  Soviet  military 
doctrine.  Natural  Soviet  proclivity  to  secretiveness  coupled 
with  Soviet  experience  and  lessons  learned  over  the  last 
thirty  years  have  convinced  Soviet  leaders  that  cover  and 
deception  are  invaluable  tools  in  tactical  warfare.  These 
attitudes  have  undoubtedly  been  strengthened  by  the 
successful  application  of  0<VD  by  both  Arabs  and  Israelis  in 
the  Middle  East  wars,  and  by  the  British  in  the  Falklands. 

The  commonly  held  Western  appreciations  of  Soviet 
CAD  capabilities  may  already  be  out  of  date.  The  Soviets 
have  repeatedly  demonstrated  their  ability  to  make  the 
improvements  necessary  to  bring  capability  in  other  fields 
up  to  the  demands  of  doctrine.  A  clear  example  of  this  is 
the  comparatively  recent  development  and  mass  deployment 
of  equipment  (such  as  the  KIPOV-class  VSTOL  carrier) 
which  make  the  extension  of  Soviet  offensive  doctrine  into 
the  naval  domain  a  credible  threat  to  NATO.  A 
corresponding  effort  to  increase  the  level  and  sophistication 
of  their  CAD  capabilities  can  therefore  be  hypothesized  as  a 
most  likely  Soviet  course  of  action. 

Soviet  Cover  and  Pecept'on  Poctrine 

Soviet  doctrine  for  Co\er  and  Deception  derives  from 
the  doctrinal  requirement  for  surprise,  one  of  the  basic 
principles  of  what  the  Soviets  call  "operational  art".  Tnese 
basic  principles,  in  the  order  assigned  by  the  Soviet  author 
Savkin  [3],  include: 

•  Mobility/Tempo 

•  Concentration  of  Efforts 

•  Surprise 

•  Activeness  of  Combat 

•  Preservation  of  Effectiveness 

•  Conformity  of  Coals  to  Conditions 

•  Interworking 

Mobilty/Tempo  includes  not  only  speed  of  movement 
but  also  flexibility  ,  such  as  in  changing  the  axis  of  an 
attack.  Concentration  of  efforts  is  more  familiar  to  us  as 
the  principles  of  mass  and  economy  of  force.  By  "Surprise", 
the  Soviets  mean  the  ability  to  force  an  enemy  to  fight  in  a 
situation  unfavorable  to  him-  either  in  a  place  or  time  which 
does  not  allow  him  to  make  full  use  of  his  own  forces. 
"Activeness  of  combat"  states  the  Soviet  desire  to  hold  the 
initiative  ;  this  is  also  the  principle  of  the  offensive. 
"Preservation  of  Effectiveness"  is  tne  Soviet  reaction  to  the 
advent  of  weapons  of  mass  destruction,  which  require  that 
their  forces  avoid  premature  concentration,  and  that  they 
be  equipped  to  survive  in  a  CBR  environment.  "Conformity 
of  Coals  to  Conditions"  demands  that  the  commander  assign 
reasonable  goals  to  his  forces.  Because  commanders  adhere 
to  this  principle,  the  subordinates  can  therefore  be  held 


accountable  for  any  failure.  "Interworking"  is  the  basis  for 
the  Soviet  combined  arms  approach  to  military  operations. 
Interworking  refers  not  only  to  the  coordination  leading  to 
joint  efforts  by  combat  forces,  but  also  to  the  coordination 
of  front  and  reserve  units,  combat  and  support  units,  center 
and  flanks. 

One  of  the  significant  features  of  these  principles  is  the 
degree  to  which  each  is  often  implemented  in  terms  of  the 
others.  A  surprised  enemy,  for  example,  is  given  no  time  to 
recover  if  the  attacking  forces  maintain  the  tempo  of  their 
attack,  and  keep  the  initiative.  Also,  the  dispersion  prior  to 
attack  necessary  to  preserve  combat  effectiveness  makes  it 
more  difficult  for  the  enemy  to  determine  the  time  and 
place  of  an  attack.  Finally,  the  Soviets  achieve  consistency 
in  their  deception  plans  by  a  combined  arms  approach  to 
CAP  operations. 

In  the  discussion  of  methods  to  achieve  surprise,  Savkin 
[4],  mentions  six  general  types: 

•  Lead  the  enemy  astray 

•  Secrecy  or  Preparation 

•  Unexpected  Use  of  Nuclera  Weapons 

•  Deliver  attacks  at  Unexpected  place/time 

•  New  means/methods  of  warfare 

•  Avoid  repetition  of  methods 

The  first  of  these  methods,  leading  the  enemy  astray 
with  regard  to  one's  intended  course  of  action,  is  the 
doctrinal  basis  for  Soviet  deception  operations,  just  as 
secrecy  of  preparation  is  the  bais  for  the  widespread  Soviet 
use  of  cover  and  camouflage  for  offensive  purposes.  The 
last  two  methods  are  useful  in  understanding  how  the 
Soviets  have  been  able  to  continue  to  surprise  their 
opponents  in  intervention  actions  over  the  past  thirty 
years.  In  his  discussion  of  "new  means  and  methods  of 
warfare",  Savkin  explains  that  this  is  usually  acheived  by 
using  existing  means  in  ways  unknown  to  an  enemy,  rather 
than  by  the  introduction  of  a  totally  new  capability.  This, 
together  with  the  avoidance  of  repetition  in  the  methods  of 
operations,  including  deception  operations,  put  our 
intelligence  system  on  notice  that  hypotheses  limited  to 
past  patterns  of  Soviet  actions  not  only  fail  as  aids  to 
detection  of  new  patterns,  but  also  increase  the  probability 
that  the  Soviets  will  exploit  our  tendancy  to  correlate  the 
elements  of  a  new  course  of  action  with  an  old  one. 

Soviet  CAD  Experience 

A  brief  summarization  of  four  Soviet  CAD  operations, 
beginning  with  the  successful  preparations  for  Operation 
Bagration  in  the  summer  of  1944,  [5]  illustrates  how  far  the 
Soviets  have  progressed  in  the  ability  to  lead  their 
opponents  astray.  Knowing  the  German  preoccupation  with 
defending  their  economic  base,  in  this  case  the  Ukraine,  the 
Soviets  covered  their  preparations  for  an  attack  in 
Belorussia  through  denying  the  Germans  any  information 
that  would  contradict  that  hypothesis.  For  this  reason,  the 
Soviets  moved  their  forces  to  their  jump-off  positions  under 
the  cover  of  darkness,  and  spread  the  observable  indicators 
of  impending  attack  over  the  entire  eastern  front-  aerial 
reconnaissance,  bomber  sorties  and  air  defense  were  not 
concentrated  in  the  central  front.  Communications  activity 
by  units  dedicated  to  the  attack  was  kept  to  a  minimum. 
Although  the  Germans  expected  a  summer  offensive,  Soviet 
security  precautions  together  with  German  assessment  that 
the  most  likely  location  for  an  attack  was  the  area  which 
they  most  feared  to  lose,  combined  to  leave  the  Germans 
unprepared  for  the  Soviet  assault.  A  significant  difference 
between  the  Soviet  CAD  operations  and  the  coincident 
US/British  effort  to  deceive  the  Germans  as  to  the  location 
of  the  cross-channel  landing,  however,  was  that  the 
deception  ended  when  the  attack  began.  This  difference  has 
continued  in  Soviet  operations  to  this  day. 
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In  their  preparations  for  intervention  in  Czechoslovakia 
in  1968,  the  Soviets  exploited  both  Western  and  Czech 
preconceptions  of  the  sequence  ot  events  which  would 
precede  such  an  action.  In  1956,  the  Soviets  had  called  the 
Hungarian  leaders  to  Moscow  and  then  invaded.  In  1968,  the 
Soviets  moved  their  forces  to  the  military  districts 
bordering  Czechoslovakia  for  a  long  series  of  exercises, 
summoned  the  Czech  leaders  to  Moscow,  and  did  not 
invade.  Instead,  the  Czechs  were  allowed  to  return  horn 
believing  the  Soviets  would  withhold  action  as  long  as  the 
Czechs  committed  no  excesses  of  liberalization.  Some 
Soviet  units  were  recalled  from  the  border  areas.  Although 
the  Soviets  still  had  sufficient  force  in  place  to  intervene, 
both  Czechs  and  the  West  changed  their  assessment  of 
Soviet  intentions.  Both  were  therefore  unprepared  when  the 
Soviets,  with  token  elements  of  other  Warsaw  Pact  nations, 
invaded  in  August.  It  is  likely  that  at  least  part  of  the 
reason  for  the  timing  of  the  invasion  was  to  coincide  with 
the  summer  vacation  season  in  Europe,  when  many  European 
decision  makers  would  be  on  vacation.  In  addition,  the 
Soviets  used  a  ruse  to  gain  control  of  the  main  airfield 
outside  of  Prague,  sending  the  landing  control  party  on  an 
Aeroflot  flight  dressed  as  tourists.  The  "tourists"  easily 
overpowered  the  Czech  control  tower  personnel  and  then 
proceeded  to  handle  the  landing  of  the  aircraft  carrying  the 
leading  elements  of  the  invasion  force. 

The  Soviet  invasion  of  Afghanistan  [6]  is  a  good 
illustration  of  how  the  Soviets  were  able  to  achieve  the 
same  end-control  of  the  captial  city  airfield  -  while  varying 
the  means.  This  time  the  Soviets  flew  in  the  airfield  control 
party  weeks  before  the  invasion  as  reinforcements  for 
Soviet  units  already  deployed  there  since  September,  thus 
arrousing  no  curiosity.  The  Soviets  then  disarmed  the 
Afghani  armored  forces  by  recalling  the  Afghan  ammunition 
and  anti-tank  guns  for  inventory,  some  of  their  tanks  for 
winterization  and  others  for  the  repair  of  defects.  Although 
western  intelligence  was  not  surprised  -  the  US  had  warned 
the  Soviets  twice  against  intervening  in  Afghanistan-the 
Soviet  choice  of  Christmas  as  the  invasion  date  meant  that 
any  Western  reaction  would  be  delayed  as  the  leaders 
hurried  back  from  their  vacations.  The  surprise  achieved 
against  the  Amin  government  is  reflected  by  the  ease  with 
which  Soviet  divisions  occupied  the  country. 

In  its  initial  stages,  the  Soviet  reaction  to  the  internal 
political  events  in  Poland  were  similar  to  those  of  1968.  [7] 
A  long  series  of  large-scale  military  exercises  were  held  in 
the  western  military  districts,  the  Polish  premier  visited 
Moscow,  and  no  immediate  invasion  occurred.  US  and 
NATO  intelligence  recognized  the  clear  threat  of 
intervention  behind  the  exercises,  and  exerted  diplomatic 
pressure  upon  the  Soviets  to  allow  the  Poles  to  settle  the 
party-union  conflict.  The  nature  and  timing  of  the  coup  in 
December  therefore  caught  both  Solidarity  and  the  US  and 
its  Allies  by  surprise.  Again  the  Soviets  chose  a  time 
(Sunday  morning  during  the  Christmas  season)  when  opposing 
decision  makers  would  be  at  home,  and  an  unexpected 
means-the  Polish  internal  security  forces.  The  intended 
victims  of  the  deception  were  successfully  deceived  even 
though  quite  sensitive  to  the  possibility  of  deception.  The 
exercises  were  widely  perceived  as  a  cover  for  the  Soviets' 
true  coui  e  of  action,  but  this  knowledge  did  not  result  in 
correct  assessment  of  Soviet  intentions. 

In  each  of  the  cases  sketched  above,  a  great  deal  of 
information  indicating  the  true  Soviet  courses  of  action  was 
available.  The  tactica'  observables  of  the  Soviet 
preparations  were  collected.  The  capabilities  of  the  forces 
involved  were  known  as  was  the  general  intent.  In  each 
case,  however,  the  Soviets  succeeded  in  deceiving  their 
victims  as  to  timing  and  method.  This  series  of  successes 
makes  the  development  of  <~*t>  countering  techniques, 
discussed  in  the  next  section,  a  necessary  part  of  any  effort 
to  improve  the  performance  of  our  own  intelligence  system. 


Developing  Techniques  to  Counter  Cover  and  Deception 

Countering  deception  is  a  two-step  process.  The  first  is 
to  identify  the  targets  for  deception,  the  second  is  to 
identify  how  C&D  directed  against  those  targets  can  be 
recognized  and  then  exploited.  These  techniques  must  th_-n 
be  integrated  into  both  sensor  related  improvements  and 
into  the  development  of  expert  systems  and  other  ADP  tools 
for  intelligence  analysis. 

Identifying  C&D  Vulnerabilities 

Identifying  C&D  vulnerabilities  can  benefit  from  the 
great  effort  already  expended  in  the  development  of 
structured  indications  and  warning  systems.  These  systems 
break  down  a  range  of  courses  of  action  into  the  steps 
(indicators)  required  to  achieve  them,  decompose  these 
steps  into  their  key  activities,  and  then  identify  the 
observables  associated  with  each  key  activity.  These 
observables  are  the  high  value  targets  for  C&D  operations, 
since  by  managing  an  opponent's  collection  of  these 
observables,  the  deceiver  exerts  control  on  the  basis  of  the 
victim's  perceptions.  In  order  to  understand  how 
perceptions  can  be  managed,  it  is  therefore  necessary  to 
begin  by  identifying  the  sources  and  methods  used  to  gather 
intelligence  data  (see  Figure  3).  Identifying  the  targets  of 
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FIGURE  3  COLLECTION  DISCIPLINES  AND  SENSORS 


C&D  involves  a  "reverse  engineering"  process  (see 
Figure  4).  We  know  how  individual  collection  sources  and 
disciplines  can  be  exploited  by  an  opponent  in  a  general  way, 
and  we  know  which  disciplines  are  employed  to  collect  given 
observable.  By  matching  collection  means  and  C&D 
methods  pairs  to  the  association  of  collection  means  and 
observables,  we  can  construct  a  C&D  matrix  for  each  key 
activity.  Figure  5  presents  a  sampl-  C&D  means  matrix  for 
one  possible  key  activity-the  deployment  forward  of  a 
technical  unit  (such  as  a  bridging  unit).  Large  scale 
deployment  of  such  units  would  be  required  prior  to  the 
initiation  of  hostilities.  Each  row  in  the  matrix  summarizes 
how  the  observable  within  a  particular  collection  discipline 
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FIGURE  4  DEVELOPING  TECHNIQUES  TO  COUNTER  C&D:  IDENTIFYING  VULNERABILITY 


ACTIVITY  TO  BE  CONCEALED  OR  SIMULATED:  DEPLOY  TECHNICAL  UNIT  FORWARD 


DISCIPLINE 


COMINT 


COVER  &  DECEPTION  MEANS 
•SIMULATE  SOUND  OF  VEHICLES 

•MAINTAIN  NORMAL  LEVELS  OF  COMMUNICATIONS  AT  GARRISON  LOCATIONS  TO  MASK  MOVEMENT  OF  UNITS 
•MAINTAIN  STRICT  COMMUNICATIONS  SILENCE  BY  THE  MOVING  UNITS 


EUNT  •STAGGER  RADAR  CHECK-OUT  BEFORE  DEPLOYMENT  TO  SIMULATE  NORMAL  ACTIVITY 
•MAINTAIN  NORMAL  LEVELS  AND  TYPES  OF  RADAR  ACTIVITY  IN  GARRISON  AREAS 

HUMINT  •RELEASE  COVERING  EXPLANATION  FOR  ACTIVITY  (E.G.,  EXERCISE  ANNOUNCEMENT,*  TROOP  ROTATION) 

■HINT  • DEPLOY  THROUGH  AREAS  WITH  HIGH  LEVELS  OF  BACKGROUND  HEAT  (URBAN  AREAS,  MAJOR  ROADS) 

•DEPLOY  THROUGH  AREAS  WHICH  ABSORB  IR  EMISSIONS 

_  •SIMULATE  NORMAL  GARRISON  ACTIVITY  WITH  NONESSENTIAL  VEHICLES _ 

CPINT  'CAMOUFLAGE  DEPLOYING  VEHICLES  AS  NON-MILITARY 

•SIMULATE  ESSENTIAL  VEHICLES  IN  GARRISON  WITH  NONESSENTIAL  ONES 
•MOVE  AT  NIGHT 


PHOTINT  I ‘SIMULATE  ESSENTIAL  VEHICLES  WITH  DUMMIES  IN  GARRISONS 


VI  SINT 


•DIVERT  FOREIGN  OBSERVERS  FROM  DEPLOYMENT  ROUTES 
•ALLOW  OBSERVERS  TO  SEE  STAGED  ACTIVITIES  ELSEWHERE 


FIGURE  5  EXPANSION  OF  SAMPLE  MEANS  MATRIX 
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could  be  simulated.  The  more  elaborate  the  deception,  the 
greater  number  of  these  methods  would  be  employed,  and 
the  larger  the  number  of  units  simulated. 

Uncovering  C&D  depends  upon  discovery  of 
inconsistency.  The  Soviets  are  likely  to  apply  their  military 
doctrine  to  C&D  operations,  in  that  they  will  strive  to 
achieve  consistency  with  the  least  effort 
necessary  (economy  of  force),  and  with  integration  of  C&D 
operations  in  all  domains  (interworking).  To  counter  C&D 
therefore  means  to  progress  from  line  items  in  the  means 
matrix  to  correlation  of  line  items  in  the  matrix,  and  then 
to  the  higher  levels  of  the  inJica'ions  structure,  as  well  as 
to  other  intelligence  disciplines  (for  example,  the  order  of 
battle).  This  search  lor  inconsistency  takes  place  on  three 
levels,  each  demanding  a  higher  level  of  man  (or  machine) 
intelligence: 

•  '  Single  collection  discipline 

•  Multiple  collection  disciplines 

•  Analytical  procedures  involving  one  or  more 
intelligence  disciplines 

Each  technique,  properly  employed,  should  stretch  the 
deceiver's  web  of  consistency  in  the  observables  harder, 
until  finally  it  gives  way. 

Single  Discipline  Techniques 

Single  discipline  techniques  address  the  weaknesses  in 
the  collection  and  interpretation  chain,  and  can  be  divided 
into  two  types; 

•  bringing  the  target  into  the  field  of  view  and 

•  increasing  target  discrimination. 

Success  in  either  of  these  can  be  achieved  by  improving 
either  the  sensor  capabilities,  the  exploitation  processing,  or 
even  by  alerting  interpreters  to  the  likelihood  of  a 
particular  C&D  method. 

As  an  example  of  the  application  of  these  techniques, 
consider  an  enemy  attempting  to  deploy  SAM  units  forward, 
and  attempting  to  cover  the  radar  emissions.  He  may  try  to 
keep  them  out  of  our  ELINT  field  of  view  by  restricting  the 
time  and  power  of  his  emissions.  They  would  be  brought 
into  the  field  of  view  by  expanding  the  duration  of  coverage, 
or  by  deploying  more  sensitive  sensors.  This  affects  both 
the  enemy's  ability  to  avoid  coverage,  and  his  ability  to 
remain  undetected  in  the  presence  of  a  collector.  The 
enemy  might  also  seek  to  cover  the  mass  deployment  of 
radars  forward  by  testing  them  individually,  so  that  the 
overall  level  of  SAM  radar  activity  in  a  given  area  does  not 
change.  This  can  he  uncovered  by  increasing  the  ability  to 
discriminate  among  the  radar  signals  of  different  units  with 
the  same  types  of  radars. 

Multiple  Discipline  Techniques 

Multiple  discipline  techniques  seek  to  break  down 
inconsistencies  between  two  or  more  observables  associated 
with  the  same  key  activity.  The  first  step  in  applying  these 
techniques  is  to  take  advantage  of  the  means  matrix  to 
identify  the  opportunities  for  multiple  discipline 
correlation.  There  follows  a  determination  of  whether  the 
current  collection  schedules  for  the  sensors  involved  allow 
simultaneous  coverage.  Planning  for  such  coverage 
increases  the  burden  of  activity  required  to  maintain  a 
deception.  Finally,  the  analyst's  ability  to  make  effective 
use  of  multi-source  coverage  requires  that  f  at  the 
interpretation  of  the  collection  be  organized  by  activity. 
For  the  radar  in  the  above  example,  therefore,  the  analyst 
would  be  given  the  PHOT'NT,  ELINT  and  other  coverage  for 
a  particular  area  over  a  specified  time  range.  Multiple 
discipline  techniques  make  cover  difficult  at  the  tactical 
level,  and  make  simulation  extremely  difficult,  since  the 
effort  required  to  simulate  an  activity  in  many  differnt 


disciplines  may  be  greater  than  the  effort  required  by  the 
activity  itself. 

Analytical  Techniques 

Beyond  current  collection  and  exploitation  the 
intelligence  analyst  can  uncover  a  C&D  operation  by 
comparing  current  activity  with  the  knowledge  base  of  an 
opponents  capabilities  and  options.  These  comparisons  are 
intelligence  cross-discipline  consistency  checks,  in  which 
current  intelligence  is  matched  with  basic  intelligence  on 
the  one  hand  and  threat  assessment  on  the  other. 

Basic  intelligence  provides  the  analyst  with  a  reference 
of  what  an  opponent  can  do.  This  includes  the  physical 
capabilities  of  equipment-can  a  mobile  radar  deploy  from  A 
to  B  in  a  given  time?  In  addition,  it  provides  an 
organizational  and  doctrinal  reference  for  current  activity. 
These  are  particularly  useful  in  evaluating  the  activity  of 
the  Soviet  military,  which  has  minimized  organizational 
variations  and  which  does  not  encourage  deviations  from 
standard  operating  procedures.  A  simulated  SAM  battalion, 
for  example,  must  include  the  correct  number  and  relative 
'ocation  of  radars,  laurchers  and  communications 
equipment.  From  the  organizational  and  doctrinal 
standpoint,  it  must  be  co-located  with  one  of  a  limited 
number  of  other  types  of  units.  Discrepancies  in  any  of 
these  factors  becomes  the  basis  for  requests  for  additional 
collection,  and  for  expanding  the  scope  of  the  analytical 
evaluation. 

Once  the  time  and  space  relationships  between 
indicators  and  an  opponent's  likely  courses  of  action  have 
been  established,  these  can  also  help  uncover  a  C&D 
operation,  and  can  also  help  the  analyst  recognize  when  the 
actual  course  of  action  does  not  match  any  of  the 
hypothesis.  One  of  the  major  benefits  of  the  structured 
warning  systems  is  that  the  analyst  can  be  alerted  to  the 
inconsistent  absence  of  activity.  This  absence  could  occur 
under  any  of  the  following  conditions: 

•  the  activity  is  present,  but  is  being  covered 

•  the  activity  not  present,  and  the  other  key  activities 
are  being  staged 

•  the  activity  is  not  present,  and  the  other  activities 
are  part  of  a  course  of  action  outside  the  current 
range  of  hypotheses 

The  analyst  can  identify  which  explanation  applies  by 
increasing  collection  and  exploitation  effort  to  uncover 
activities  if  it  exists.  If  it  is  not  found,  solutions  must  be 
sought  along  both  collection  and  analysis  paths. 

In  the  collection/interpretation  domain,  increased 
effort  would  be  applied  to  determine  if  some  of  the 
observed  activities  are  actually  simulations.  At  the  same 
time,  the  threat  assessment  process  needs  to  reevaluate 
whether  the  absent  activity  is  a  necessary  part  of  a  course 
of  action,  and  whether  a  new  hypothesis  would  be  consistent 
with  the  current  combination  of  active  and  inactive 
indicators.  The  discovery  of  C&D  operations  during  this 
process  has  an  additional  value  in  that  their  existance  is 
itself  an  indication  of  an  opponent's  course  of  action. 

Conclusions 

Current  collection,  exploitation  and  intelligence 
analysis  systems  are  vulnerable  to  cover  and  deception. 
This  vulnerability  has  increased  as  analytical  aids  have 
tended  to  focus  on  the  observables  associated  with  the  key 
steps  for  a  limited  range  of  courses  of  action.  Soviet 
doctrine,  with  its  emphasis  on  surprise  and  continuing 
variations  in  the  means  of  realizing  surprise,  is  ideally 
suited  to  exploit  those  limitations,  and  they  have  been 
uniformly  successful  in  applying  this  doctrine  to  the  present 
day.  Both  current  analysis  aids,  which  are  essentially 
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production  systems,  and  the  expert  systems  now  under 
development  [8] are  basically  similar  to  commercial  systems 
developed  for  medical,  geological  or  engineering 
applications,  p]  For  successful  application  to  military 
intelligence,  the  technology  must  be  "hardened"  to 
withstand  the  skillful  use  of  Cover  and  Deception. 
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I.  Introduction  and  Summary 


A  DSN  Is  a  surveillance  and  tracking  systea 
enploylng  many  geographically  dispersed 
sensor/processor  nodes  connected  by  a  coaputer 
coaounlcatlons  network  and  lapleaented  as  a 
confederacy  of  Identical  autonomous  cooperating 
processea.^.  The  general  properties  of  a  DSN  are  being 
investigate^  through  the  development  and  exercise  of  a 
test-bed  systea.  The  detection  and  tracking  of 
low-flying  aircraft  using  simple  acoustic  sensors  has 
been  selected  as  a  specific  problem  to  be  addressed. 
Each  node  in  Che  test-bed  consists  of  an  array  of 
microphones  as  the  sensor,  several  small  computers  for 
data  processing,  and  a  digital  radio  for  communicating 
Information  between  the  nodes.  Use  of  microphone 
arrays  as  sensors  limits  each  node  to  measuring  target 
azimuths;  nodes  must  exchange  target  azimuth 
information  In  order  to  track  target  positions. 

\ 

A  DSN  need  not  be  as  simple  or  as  homogeneous  ao 
the  test-bed.  It  can  mix  sensor  types  such  as  radar 
and  passive  IR,  putting  one  type  or  both  at  any  node. 
Data  procesaora  need  not  be  colocated  with  all  censors 
and  vice  versa.  Data  processing  capacity  can  vary 
from  node  to  node  and  communications  capacity  can  vary 
from  link  to  link.  The  communications  network  can  mix 
broadcast  radio,  point-to-point  radio,  and  wire.  But 
It  Is  our  opinion  that  all  of  the  significant 
DSN-related  technical  Issues  which  must  be  addressed 
In  developing  such  complex  systems  must  also  be 
addressed  In  developing  the  test-bed  system  and  that 
the  relative  simplicity  of  the  test-bed  helps  focus 
attention  on  those  DSN-related  Issues. 

This  paper  describes  the  acoustic  tracking 
algorithms  currently  used  In  the  MIT  Lincoln 
Laboratory  (MIT/LL)  Distributed  Sensor  Networks  (DSN) 
test-bed.  It  discusses  the  original  motivation  for 
inclusion  of  various  features  In  those  algorithms  and 
the  lessons  learned  about  those  features  through 
experimentation  with  real  and  simulated -data.  Plans 
for  modifications  to  the  detection  and  tracking 
algorithms  are  briefly  sketched. 

Acoustic  propagation  Introduces  delays  In  azimuth 
measurements  which  complicate  the  merger  of  target 
azimuth  information  Into  target  position  estimates. 
At  any  one  measurement  time,  each  sensor  measures 
target  azimuths  that  correspond  to  different  times  In 
the  past,  depending  on  target-to-sensor  range.  A  node 
cannot  estimate  target  position  from  Just  the  most 
recent  measurements  because  they  correspond  to 
different  target  positions.  Each  node  must  maintain  a 
hitory  of  past  sensor  measurements  In  order  to  track 
targets.  Two  algorithms  have  been  developed  for 
estimating  target  positions  from  histories  of  sensor 
xeasurements  produced  by  two  or  more  .nodes. 


One  algorithm  has  the  advantage  that  It 
Introduces  no  more  delay  into  the  target  detection  and 
tracking  process  than  Is  forced  by  acoustic 
propagation.  But  at  one  measurement  time.  It  usually 
estimates  the  positions  for  different  targets  at 
different  times.  Over  a  regular  sequence  of 
measurement  times.  It  usually  estimates  the  position 
of  the  same  target  at  Irregularly  spaced  times. 

The  other  algorithm  forme  estimates  ac  each 
measurement  time  with  a  specified  delay  which  Is 
uniform  for  all  targets.  The  product  of  the  delay  and 
the  speed  of  sound  Is  the  maximum  range  at  which 
targeta  can  be  tracked,  so  that  delay  Is  usually  set 
equal  to  the  sensor  detection  range  divided  by  the 
speed  of  sound.  On  the  surface,  the  generally  larger 
delay  of  the  latter  algorithm  makes  it  appear 
undesirable.  However,  the  Irregularity  of  estimation 
times  produced  by  the  first  algorithm  so  complicates 
further  data  processing  as  to  make  it  even  less 
attractive. 

The  microphone  arrays  used  In  the  test-bed  have  a 
detection  range  on  the  order  of  5  to  10  km  and  the 
radios  have  a  similar  range.  Thus,  no  one  node  can 
measure  a  target's  azimuth  for  very  long.  This 
difficulty  can  be  overcome  If  each  node's  azimuth  data 
Is  distributed  to  every  other  node.  But  to  do  so 
would  be  difficult  In  a  large  DSN,  producing  heavy 
communications  traffic  and  requiring  much  redundant 
data  processing  in  each  node.  The  least  distribution 
practical,  broadcast  of  azimuth  data  only  to  nodes  In 
direct  radio  contact.  Is  done  In  the  test-bed  so  as  to 
hold  down  communications  traffic. 

Only  azimuth  data  Is  now  exchanged  by  the 
test-bed  nodes.  Because  this  data  Is  exchanged  only 
between  nodes  In  direct  radio  contact  and  because  the 
radio  range  Is  comparable  to  the  sensor  range,  each 
node  has  sufficient  Information  to  track  the  positions 
of  targets  within  Its  sensor  coverage.  This 
restriction  Impacts  target  position  tracking  by 
individual  nodes  only  In  the  area  of  track 
Initiation.  As  a  target  moves  through  the  DSN,  each 
node  must  acquire  the  target  as  It  enters  the  node's 
sensor  coverage  and  initiate  a  new  position  track  for 
the  target  despite  the  likely  existence  of  position 
tracks  for  that  target  In  other  nodes.  This 
restriction  also  has  a  major  Impact  on  the 
surveillance  function  of  the  DSN  as  a  whole  because 
Individual  nodes  have  a  myopic  view  of  the  targets 
within  the  coverage  of  all  Che  DSN's  sensors. 

Intuitively,  exchanging  position  tracks  as  well 
as  azimuth  data  between  nodes  In  direct  radio  contact 
should  allow  target  position  tracks  to  be  handed  over 
1  istead  of  reinitiated.  And  more  extensive 
communication  of  position  tracks  should  allow 
formation  of  a  complete  surveillance  picture.  Eut  It 
is  not  yet  clear  how  to  combine.  In  a  statistically 
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valid  manner,  position  tracks  formed  In  different 
nodes  using  sensor  data  vhlch  Is  not  all  exchanged. 
Recent  research  has  examined  related,  simpler 
problems,  and  suggests  that  this  difficulty  is 
surmountable.  However,  the  applicability  of  the 
research  results  needs  further  study. 

II.  MIT/LL  DSN  Test-Bed 

Figure  1  shows  a  simple  block  diagram  of  one  DSN 
test-bed  node  as  well  as  photographs  of  a  microphone 
array  and  a  mobile  node  vehicle  carrying  the 
associated  computers,  radio,  and  their  power  supply. 
Atmospheric  conditions  limit  the  maximum  range  at 
which  typical  targets  can  be  detected  to  between  5  and 
10  km.  The  test-bed  radios,  currently  under 
construction,  will  typically  be  limited  to  a  similar 
range  by  llne-of-sight  propagation.  Thus,  each  node 
communicates  directly  with  those  nodes  having 
overlapping  sensor  coverage  and  only  those  nodes.  The 
radios  are  also  designed  to  measure  the  range  between 
nodes,  allowing  the  test-bed  to  estimate  the  relative 
locations  of  Its  nodes  as  Indicated  in  the  figure. 
Until  the  packet  radios  are  available,  broadcasts  are 
simulated  using  wire  communications. 

Figure  2  expands  the  cracking  block  In  Figure  1 
and  Includes,  for  reference,  the  microphone  array  and 
the  signal  processor.  It  shows  all  of  the  data  flows 
In  the  detection  and  tracking  system,  including  those 
between  nodes.  The  data  flows  are  complex  but,  as 
will  be  seen,  the  complex  flows  are  necessary.  The 
data  from  each  microphone  array  Is  processed  every  two 
seconds  using  adaptive,  nonlinear  filtering1-3  to 
detect  local  maxima  of  the  Incident  sound  power 
(averaged  over  the  preceding  two  seconds)  In  frequency 
and  azimuth  angle.  A  measurement  of  each  detection's 
frequency,  azimuth  angle,  and  average  sound  pressure 
level  Is  produced  in  the  process. 

The  detections  and  measurements  are  made  over  a 
four  to  five  octave  frequency  band.  Such  broad 
bandwidth  allows  detection  of  multiple  harmonic 
emissions  by  a  single  target.  Sensor  data 

conditioning  Is  done  to  reduce  tracking  computational 
load  by  clustering  together  detections  which  could 
plausibly  have  been  caused  by  a  single  target  and  by 
discarding  detections  which  are  relatively  weak  In 
sound  pressure  level.  Each  cluster  Is  characterized 
by  the  average  azimuth  angle  of  the  component 
detections  (weighted  by  sound  pressure  level)  and  by 
the  total  sound  pressure  level.  The  average  azimuth 
angles  are  treated  as  target  measurements  thereafter. 

Acoustic  propagation  variability  makes  Incident 
sound  pressure  level  a  second-order  measure  of  target 
range;  the  sensors  are  primarily  azimuth-only 
measurement  devices.  Thus,  the  full  target  state  is 
not  observable  using  a  single  sensor  and  no  node  can 
accurately  estimate  It  without  Information  from  other 
nodes;  the  nodes  must  share  measurement  Information  to 
fora  ar  estimate  of  the  full  target  state. 

Low-flying  aircraft  can  travel  at  an  appreciable 
fraction  of  the  speed  of  sound  and  even  exceed  It  In 
some  esses.  As  a  result,  the  sensors  measure  target 
azimuth  angle  not  *  the  measurement  times  but  at  the 
times  when  the  measured  sounds  were  emitted.  Target 
azimuth  angles  corresponding  to  present  target 
positions  are  not  observable.  The  lack  of 
observability  prevents  the  direct  application  of 
familiar  tracking  techniques.  Including  the  Kalman 
filter,  which  update  target  state  estimates  using 
current  measurement  data  only.  Further  complications 
are  the  nonlinearity  of  Che  measurement  process  and 


its  affective  noncausality;  a  measurement  by  one 
sensor  may  be  made  later  than  a  measurement  by  another 
sensor  but  may  contain  information  about  the  target 
state  at  an  earlier  time  In  the  target's  frame  of 
reference. 

Target  azimuth  data  must  be  accumulated  over  time 
iu  order  to  form  target  position  tracks.  The  first 
step  In  this  accumulation  Is  target  azimuth  tracking. 
The  azimuth  tracker  in  each  node  is  quite 
conventional,  with  the  exception  that  It  does  not 
estimate  the  unobservable  present  target  azimuth 
angle  but  rather  the  observable  "acoustic"  azimuth 
angle.  The  tracker  uses  a  two  state  (azimuth  angle 
and  azimuth  angle  rate)  a- 8  tracking  algorithm  for 
estimation  and  prediction.  Data  association  la  done 
quite  simply.  A  measurement  Is  associated  with  an 
azimuth  track  and  used  to  update  the  state  estimate  If 
the  azimuth  angle  lies  within  a  window  about  the 
filter's  azimuth  angle  prediction  for  the  measurement 
time.  Only  one  association  la  allowed  per  track  or 
per  detection.  Should  a  measurement  fall  to  associate 
with  any  existing  track,  it  la  used  to  Initiate  a  new 
track.  A  newly  Initiated  track  is  terminated  If  no 
measurement  Is  associated  with  it  at  the  next 
measurement  time;  any  other  track  la  allowed  only  one 
missed  data  association  In  a  ,ow. 

Full  azimuth  creek  state  estimates  are  maintained 
within  each  node  for  the  targets  sensed  locally  but 
only  the  azimuth  component  is  broadcast  to  other 
nodes.  The  azimuth  components  computed  locally  are 
also  added  to  the  node's  azimuth  history  data  base 
along  with  all  azimuth  components  received  from  other 
nodes.  Each  azimuth  track  created  by  a  node  given 
a  unique  (within  the  node)  Identifier  which  is 
broadcast  with  each  azimuth  component.  This  tagging 
allows  azimuth  cosponents  broadcast  at  different  times 
but  based  on  the  same  azimuth  track  to  be  associated. 
Figure  3  sketches  the  organization  of  the  azimuth 
history  data  base.  It  Is  tree-structured,  with  the 
data  sorted  by  originating  node  and  azimuth  track. 

Only  the  data  In  the  azimuth  history  data  base 
Issued  in  further  processing.  The  effect  Is  to  treat 
the  esch  node's  sensor,  signal  processor,  sensor 
conditioning,  and  azimuth  tracker  as  a  virtual  sensor 
with  the  measurement  properties  of  the  whole  chain. 
Each  node  can  be  thought  of  S3  connected  to  a  number 
of  such  virtual  sensors,  one  at  Its  location  and  the 
others  at  the  locations  of  those  nodes  with  which  It 
has  direct  radio  contact.  This  artifice  helps 
modularise  the  tracking  system,  decoupling  details  of 
the  sensor,  signal  processor,  etc.,  from  the  remainder 
of  the  tracking  system. 

The  remainder  of  the  tracking  system  must  form 
estimates  of  target  dynamic  state  (position  and 
velocity)  from  these  virtual  measurements  of  zcouetic 
azimuth.  The  process  Is  done  in  two  steps  at  each 
node : 

1) Eadmate  target  positions  from  acoustic 

azimuth  data  in  the  azimuth  history  data 
base,  and 

2) Estlmate  target  dynamic  states  from  the 

estimated  target  positions. 

Azimuth  histories  can  be  combined  In  two  ways  to 
produce  position  estimates.  The  current  tracking 
system  uses  the  reflection  algorithm4*3.  The 
algorithm  takes  as  inputs  two  azimuth  histories 
originating  from  different  nodes  and  the  locations  of 
those  nodes.  If  the  two  azimuth  histories  could 
plausibly  be  associated  with  a  single  tsrget,  the 
algorithm  estimates  the  position  of  that  target  for  the 
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emission  time  o'  »*■  sound  just  detected  at  the  node 
closer  to  the  tar^  1  position  estimate  produced  by 
the  algorithm  can  be  regarded  as  a  "measurement"  of 
the  target's  position,  delayed  in  availability 
depending  on  the  target's  range  from  the  sensor.  The 
delay  is  the  least  achievable  with  any  position 
estimation  algorithm;  it  equals  the  shortest 
propagation  delay  in  the  current  azimuth  measurement 
data. 

In  each  node,  the  reflection  algorithm  is  applied 
to  all  legitimate  combinations  of  azimuth 
histories. The  resulting  position  estimates  for 
different  targets  are  for  different  times.  The  times 
can  range  from  the  most  recent  measurement  time  (for 
targets  overflying  sensors)  to  30  seconds  earlier  (for 
targets  1C  km.  from  -he  nearest  sensor).  Because 
target-to-node  range  varies  with  time,  so  does  the 
delay  for  a  single  target's  position  estimates  at 
sequential  measurement  times,  even  if  the  estimates 
are  based  on  the  same  pair  of  azimuth  histories  (same 
nodes  and  track  identifiers)  at  sequential  measurement 
times . 

The  variable  times  associated  with  target 
position  estimates  produced  by  the  reflection 
algorithm  complicate  the  process  of  building  position 
tracks  from  position  estimates.  Consider  a  target 
which  is  tracked  simultaneously  by  th*-ee  test-bed 
nodes  and  assume  that  each  node  is  within  radio  range 
of  the  other  two.  Then  each  node  contains  three 
azimuth  histories  corresponding  to  the  target. 
Applying  the  reflection  algorithm  to  each  pair  of 
histories  would  typically  yield  three  different 
poslton  estimates,  each  for  a  different  time.  It  is 
very  difficult  to  recognize  that  all  of  these  position 
estimates  correspond  to  the  same  target,  even  if  the 
azimuth  histories  are  completely  accurate  and  if  the 
reflection  algorithm  introduces  no  inaccuracies. 

For  this  reason,  position  tracks  are  currently 
formed  only  from  position  estimates  based  on 
particular  pairs  of  azimuth  histories,  i.e. ,  for  the 
3ame  pair  of  nodes  and  azimuth  trades  within  those 
nodes.  To  facilitate  this  isolation  of  position 
tracks,  they  are  kept  in  a  tree-structured  data  base 
organized  like  the  azimuth  history  data  base.  In  this 
case,  entries  are  sorted  first  by  the  originating  pair 
of  nodes  and  then  by  the  pair  of  Identifiers  of  the 
originating  azirjth  tracks  (see  Figure  4).  Position 
track  data  base  entries  consists  of  a  time  and 
estimates  of  target  easting,  northing,  east  velocity, 
and  north  velocity  at  that  time,  plus  some  auxiliary 
information. 

A  position  track  is  updated  when  a  new  position 
estimate  is  produced  for  the  pair  of  azimuth  tracks 
histories  upon  which  that  track  is  based.  A  new 
position  track  is  created  if  no  entry  exists  in  the 
data  base  corresponding  to  an  azimuth  track  pair  which 
passed  the  reflection  algorithm's  test.  A  a-B 
tracking  algorithm  is  again  used  for  prediction  and 
estimation.  Azimuth  histories  are  not  completely 
accurate  and  the  reflection  algorithm  can  amplify 
those  Inaccuracies.  The  resulting  position  estimates 
can  be  inaccurate  not  only  in  position  but  also  in 
time.  For  this  reason,  position  estimate  times  are 
smoothed  before  the  a-8  algorithm  is  applied.  The 
auxiliary  Information  in  each  position  track  data  base 
entry  is  the  state  of  the  smoother  for  that  position 
track. 

Neither  target  position  estimates  nor  position 
tracks  are  exchanged  between  nodes  in  the  current 
test-bed.  If  the  nodes  are  all  directly  connected  by 
radio  to  each  other  such  an  exchange  would  be 


unnecessary.  Each  node  would  (in  the  absense  of 
communlcatlors  failures)  have  identical  azimuth 
history  data  bases  and  produce  Identical  target 
position  estimates  and  position  tracks.  But  complete 
connectivity  is  not  practical  in  large  DSNs;  thus, 
azimuth  history  data  bases  typically  differ  from  node 
to  node  and  so  must  the  position  estimates  and 
position  tracks  based  upon  the  azimuth  history  data. 
Each  node  then  has  an  incomplete  picture  of  the 
targets  in  the  DSN's  coverage.  This  point  is 
illustrated  in  the  next  section. 

Exchanging  target  position  estimates  or  position 
tracks  could  provide  nodes  with  additional 
information,  but  consider  an  extreme  example,  of  what 
can  go  wrong  if  target  state  estimates  are  not 
properly  combined: 

A  local  state  estimate  a  if.  creat'd  in  data 
processor  A  from  a  single  measurement  by  one  sensor. 
That  estimate  is  shared  vlth  data  processor  B,  which 
uses  it  to  create  an  identical  local  state  estimate, 
8.  Data  processor  B  shares  its  local  state  estimates 
with  data  processor  A,  including  B.  Data  processor  A 
associates  local  state  estimates  a  and  B  with  the  same 
target  and  combines  them  to  produce  local  ..rate 
estimate  X  with  half  the  variance  of  a  and  B.  This 
process  could  ultimately  lead  to  the  existence  in  data 
processors  A  and  B  of  a  local  state  estimate  u  of 
inlflniteslmal  variance  based  on  a  single  sensor 
measurement. 

Given  the  complexity  of  this  issue,  the  decision  was 
made  to  limit  Che  initial  version  of  the  test-bed  to 
exchanging  only  the  azimuth  components  of  azimuth 
tracks  and  only  between  nodes  in  direct  radio 
contact.  Later,  versions  of  the  test-bed  will 
experiment  with  more  extensive  data  exchanges. 

III.  Tracking  Performance 

All  of  the  results  shown  in  this  section  are  for 
one  field  experiments^*6.  Four  test-bed  nodes  were 
used  to  record  the  passage  of  a  OH-1  helicopter 
west-to-east  along  the  flight  path  illustrated  in 
Figure  5  at  a  ground  speed  of  roughly  45  knots  and  at 
roughly  1000  feet  above  ground  level.  The  letters  F, 
H,  J,  and  L  indicate  the  locations  of  the  four  nodes. 
The  circles  Indicate  checkpoints  used  bv  the 
helicopter  pilot  to  maintain  his  flight  path  and  by 
observers  to  time  the  helicopter's  passage. 

The  runways  overflown  near  checkpoints  3  and  4 
are  ILanscom  Field,  a  busy  military  and  civilian 
airfield.  Under  checkpoint  6  is  Route  128,  a  heavily 
used  eight-lane  superhighway.  Another  major  highway, 
Route  2,  lies  just  south  of  the  area  shown  on  the 
map.  Ncrmal  activity  at  Hanscon  field  provided 
additional  aircraft  in  sensor  coverage  during  the 
experiment  and  rormal  activity  on  Route  128  provided 
an  extended  and  Irregular  interfering  sound  source. 
Construction  vehicles  and  stationary  mechanical 
equipment  operating  near  the  microphone  arrays  also 
provided  acoustic  Interference. 

Figure  6  is  an  azimuth-time  intensity  plot  for 
the  output  of  the  signal  processor  at  node  F.  The 
plot  only  Includes  those  measurements  at  each  time 
having  a  sound  pressure  level  within  10  dB.  of  the 
maximum  value  at  that  time.  The  curves  overlying  the 
measured  values  are  thfe  sequences  of  azimuth 
components  of  the  azimuth  tracks  produced  when 
processing  the  data.  Each  continuoui  curve 

corresponds  to  one  track.  Because  of  breaks  in  the 
measurement  data,  several  distinct  tracks  (having 
distinct  track  identifiers)  are  caused  by  each  sound 
source. 
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The  curve  marked  “TRACK”  is  the  track  of  the  UH-1 
lellcopter,  which  reached  its  point  of  closest 
.pproach  at  roughly  240  seconds  and  was  roughly  600 
leters  from  the  microphone  array  at  that  time.  Two 
■ther  sound  sources  were  tracked:  a  fixed-wing 
tircraft  with  a  varying  azimuth  angle  anj  a  bulldozer 
rlth  an  essentially  constant  azimuth  angle  (due 
louth).  The  “speckles”  on  the  plot  are  signal 
irocesslng  artifacts  or  very  Intermittent  sound 
■ources.  Such  artifacts  or  sources  are  detected 
tniformly  over  time,  but  are  suppressed  by  the  sensor 
lata  conditioning  process  -arly  in  the  plotted  data 
■e cause  of  a  relatively  loud  target  near  the  sensor. 

Figure  7  shows  the  position  tracks  formed  using 
lpdated  azimuth  estimates  calculated  in  node  F  (shown 
li.  the  previous  figure)  and  in  node  H.  Each  cross  Is 
:he  position  component  of  a  newly  updated  position 
:rack.  Thin  lines  connect  sequential  position 
:omponents  of  individual  position  tracks.  The  line 
aarked  "TRACK"  is  again  the  track  of  the  UH-1 
lellcopter.  That  track  is  crossed  by  another,  that  of 
:he  fixed-wing  aircraft.  The  other  tracks  are  of 
Intermittent  sound  sources  or  are  processing 
artifacts,  e.g. ,  "ghost"  tracks  based  on  erroneous 
pairings  of  azimuth  track  histories  which  manage  to 
lass  the  reflection  algorithm  test.  The  short,  dense 
track  radiating  from  node  H  is  a  "ghost*.  Ghost 
tracks  can  often  be  recognized  because  they  trace  out 
physically  unreasonable  trajectories,  beginning  or 
ending  at  a  node  or  involving  unrealistic 
accelerations. 

Recording  of  the  measurement  data  allowed 
experimentation  with  different  connectivities  between 
the  nodes.  Figure  8  shows  the  position  tracks  formed 
in  node  J  when  it  received  azimuth  data  broadcast  only 
by  nodes  H  and  L.  The  figure  includes  distinct  but 
overlapping  tracks  of  the  UH-1  helicopter,  each 
derived  from  a  different  pair  of  azimuth  histories. 
Note  that  node  J  has  an  incomplete  surveillance 
picture.  It  does  not  Include  the  track  of  the 
fixed-wing  aircraft  which  appears  in  Figure  7.  The 
latter  track  was  baaed  in  part  on  azimuth  lnforma'  on 
sensed  at  node  F;  information  unavailable  to  node  J  in 
this  case. 

IV.  Lessons  and  Plans 

Uae  of  the  reflection  algorithm  for  target 
position  estimation  has  the  advantage  of  producing  the 
most  up-to-date  position  estimates  possible.  But  it 
has  the  disadvantage  of  producing  position  estimates 
with  time-varying  delays,  making  it  difficult  to 
recognize  position  estimates  aasociated  with  the  same 
target.  A  consequence  is  the  overlapping  UH-1 
position  tracks  in  Figure  8.  Location  estimate  time 
smoothing  was  required  in  the  position  tracking 
algorithm  as  a  consequence  of  inaccuracies  in  the 
time-varying  delays.  Experiments  using  simulated 
measurement  data  with  no  azimuth  measurement 
inaccuracies  has  revealed  that  this  smooching  process 
by  Itself  produces  some  inaccuracies  in  the  position 
tracks.  These  observations  have  led  us  to  question 
whether  the  advantage  of  the  reflection  algorithm  is 
worth  the  disadvantage. 

Another  position  estimation  algorithm,  the 
possible  position  algorithm5*7*8  takes  the  same 
Inputs  as  the  reflection  algorithm  and  produces 
position  estimates  with  a  fixed  delay.  Targets  at 
ranges  great  enough  for  acoustic  propagation  time  to 
exceed  that  delay  are  Ignored  by  this  algorithm  even 
if  they  are  detected.  So  the  delay  is  usually  chosen 
to  equal  the  propagation  time  for  sound  from  a  target 
at  the  maximum  detection  range.  This  delay  would  be 


30  seconds  for  the  test-bed.  For  a  target  very  near  a 
node,  the  possible  position  algorithm  cause  a  delay 
nearly  as  large  between  detection  of  the  target  by 
that  node's  senror  and  the  creation  of  a  corresponding 
position  estimate.  Lesser  delays  would  occur  for 
targets  further  from  nodes.  Since  targets  are 
typically  first  detected  away  froa  all  nodes  and  are 
well  in  trade  by  the  time  they  are  close  to  any  node, 
this  disadvantage  ia  probably  lees  significant  than  It 
appears  on  the  surface. 

The  availability  of  position  estimates  are 
regular  and  comaon  times  should  allow  significant 
simplification  of  the  position  tracker.  Location 
estimates  plausibly  caused  by  the  same  target  could  be 
clustered  together  In  the  same  manner  as  are  azimuth 
measurements  produced  by  the  signal  processor. 
Entries  in  the  position  track  data  base  would  not  need 
to  be  sorted  by  originating  azimuth  history  pairs  and 
more  u  ual  data  association  could  be  done.  Smoothing 
of  the  position  estimate  timer  would  be  unnecessary, 
eliminating  this  potential  source  of  error. 
Experiments  using  the  possible  position  algorithm  for 
target  position  estimation,  and  the  simplified 
position  track  data  rase  and  tracking  algorithm  it 
allows,  are  planned  to  evaluate  the  trade-offs  between 
timeliness  and  complexity  of  the  twu  algorithms. 

The  limited  sharing  of  information  between  nodes 
in  the  current  test-bed  does  not  Interfere  with 
position  tracking  by  individual  nodes.  But  it 
prevents  nodes  from  "handing  over*  target  position 
tracks  as  the  targets  pass  through  the  DSN* a  sensor 
coverage  and  can  cause  each  node  to  have  a  myopic  view 
of  the  targets  in  the  DSN's  overall  sensor  coverage  as 
illustrated  in  the  previous  section.  Results  of 
recent  research®* *0  into  related  problems  suggest 
that  it  should  be  possible  to  form  a  complete 
surveillance  picture  in  each  data  .processor  if  each 
one  transmits  locally  computed  target  state  estimates 
to  other  data  processors  and  if  each  uses  the  state 
estimates  it  receives  as  well  as  available  raw  sensor 
data  to  update  its  local  target  state  estimates.  But 
the  research  re  ulte  demonstrated  only  asymptotic 
convergence  of  estimates  of  an  invariant  quantity.  We 
must  examine  this  work  carefully  to  extract  clues  as 
to  the  proper  organization  of  algorithms  for  our  more 
complex  situation.  At  the  very  least,  we  would  like 
to  develop  an  ad  hoc  algorithm  for  comlblnlng  state 
estimates  for  the  purposes  of  position  track  hand-over 
that  exhibits  minimal  pathological  behavior. 

The  need  to  demonstrate  a  system  which  performs 
satisfactorily  in  a  realistic  environment  will 
continue  to  drive  the  development  of  the  Lincoln 
Laboratory  DSN  test-bed  tracking  system.  In  parallel 
with  the  development  of  new  tracking  algorithms,  data 
will  be  collected  on  actual  sound  sources  in  stressing 
environments  such  as  crossing  targets  on  known 
trajectories.  Regular  experimentation  with  such  data 
focuses  development  of  the  bystem  on  significant 
problems,  providing  a  starting1  point  for  theoretical 
investigations  and  a  timely  test  of  any  ad  hoc  or 
approximate  aspects  of  the  system. 
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Fig.  1.  The  MIT/LL  DSN  Test-Bed. 
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Fig.  2.  Funct lonal  Description  of  the  MIT/LL  DSN 
Test-Bed  Tracking  System. 


Fig.  3.  Azimuth  History  Data  Base  Organization. 


Fig.  4.  Position  Track  Data  Base  Organization. 


SECONDS 

Fig.  6.  Azimuth  Ar.gle  Measurements  and  Azimuth  Tracks 
for  Node  F. 
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Fig.  7.  Position  Tracks  Formed  at  Nod>  F  from  Azimuth 
Histories  Originating  at  Nodes  F  and  it. 


Fig.  8. 
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Position  Tracks  Formed  at  Nodr  J  from  Azimuth 
Histories  Originating  at  Nodes  H,  J  and  L. 
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ABSTRACT 

»  -  r-\ 

'  Iff  this  paperuwe  considerythe  distributed  eatima- 
tion  pro^ lets  by  a  set  of  agents  connected  by  an  arbi¬ 
trary  communication  network.  The  agents  communicate 
conditional  probabilities  of  the  random  state  over  the 
network.  From  these  conditional  probabilities,  each 
agent  then  tries  to  re-conatruct  the  conditional  proba¬ 
bility  given  all  the  measurements  if  these  were  commun¬ 
icated  instead  of  the  probabilities.  It  is  discovered 
that  in  general  the  agents  have  to  remember  some  of  the 
past  conditional  probabilities  and  may  even  have  to 
request  additional  information.  A  method  for  generat¬ 
ing  the  fusion  algorithm  for  each  agent  based  on  the 
network  structure  is  presented  and  applied  to  some 
examples.  The  results  are  applicable  to  both  dynamic 
and  static  statea. 


1 .  INTRODUCTION 


The  traditional  approach  to  estimation  has  been 
centralized.  Even  though  the  meaaurements  are  gen¬ 
erated  by  a  large  number  of  sensors,  it  is  usually 
assumed  that  they  are  sent  to  s  central  site  where  pro¬ 
cessing  is  carried  out  by  one  agent  (computer).  In 
this  context  centralized  estimation  theory  is  well 
developed  and  has  founu  applications  in  many  real  world 
problems. 
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Reuearch  in  distributed  estimation  hat  progressed 
along  several  directions.  A  team- theoretic  approach 
has  been  taken  by  Barta  [3]  for  decentralized  linear 
estimation  and  by  Tenney  and  Sandcll  14)  for  distri¬ 
buted  detection.  Extensions  of  this  work  in  detection 
have  been  made  by  Teneketzis  IS]  and  Ekchian  and  Tenney 
[6].  Another  approach,  based  on  finding  constrained 
decentralized  filters,  has  been  taken  by  Tacker  and 
Sanders  [7).  The  approach  of  fusion  or  combining  of 
local  estimates  to  recover  the  globally  optimal  esti¬ 
mate  has  been  used  in  [8]  to  [12].  The  linear  problem 
was  considered  by  Speyer  [8],  Chong  [9],  Willaky  et  al. 
[10]  and  Levy  et  al.  [11]  while  Castanon  and  Teneketzis 

[12]  considered  the  nonlinesr  extension.  In.  all  of  the 
above  [81— [11],  the  system  structure  is  hierarchical 
with  no  feedback  communication  or  coordination  from  the 
fusion  agent.  Similar  problems  of  this  type  have  alto 
been  considered  in  the  management  science  literature 

[13] . 

The  network  aspect  in  the  distributed  estimation 
problem  has  been  the  emphasis  in  [14],  [15]  and  dis¬ 
cussed  in  [2].  Borkar  and  Taraiya  [14]  presented 
results  on  the  asymptotic  agreement  among  agents  for 
estimation  while  Tsitaiklis  and  Athens  [15]  considered 
asymptotic  agreement  for  more  general  decision  prob¬ 
lems.  It  has  been  demonstrated  in  [2]  via  an  example 
that  agreement  may  not  be  desirable  since  the  common 
conclusion  may  be  wrong. 


In  recent  years,  there  has  been  growing  interest 
in  distributed  estimation  problems.  In  such  problems, 
the  sensor  measurements  sre  not  all  transmitted  to  s 
central  processor.  Instead,  a  set  of  local  processors, 
which  we  cell  estimation  agents,  sre  present.  The 
agents  are  connected  by  a  communication  network.  Each 
agent  collects  the  measurements  from  a  subset  of  the 
tensors,  performs  some  local  processing,  snd  communi¬ 
cates  the  results  with  other  agents. 

The  advantages  of  such  a  distributed  estimation 
system  sre  many.  It  is  mote  reliable  (or  less  vulner¬ 
able)  since  there  is  not  a  single  central  site  which  is 
responsible  for  the  proper  functioning  of  the  system. 
Communication  is  cheaper  since  only  the  results  of  pro¬ 
cessing,  and  not  the  raw  data,  sre  communicated. 
Furthermore,  each  distributed  agent  bss  the  use  of  the 
processed  data  locally  and  does  not  have  to  wait  tor 
comuinication  from  the  central  processor.  From  a  tech¬ 
nological  point  of  view,  such  distributed  systems  are 
made  possible  by  the  availability  of  cheap  computing 
hardware.  These  advantages  make  distributed  estimation 
systems  extremely  attractive  for  many  military  and 
civilian  applications.  One  such  application  is  the 
distributed  sensor  network  [1],  [2]  for  tracking  and 
surveillance. 


In  this  paper,  we  elaborate  the  results  obtained 
in  [2].  The  philosophy  of  fusion  or  combining  of  local 
conditional  probabilities  to  obtain  the  probability 
conditioned  on  all  available  information  is  again  used. 
However,  arbitrary  network  structures  are  considered 
explicitly.  They  may  be  hierarchical  with  or  without 
feedback  from  the  higher  level  or  fully  distributed. 

The  presentation  is  at  a  fairly  elementary  level  to 
simplify  the  nocation  but  can  be  made  more  sophisti¬ 
cated  if  desired  by  introducing  sigma  fields.  The 
results  may  provide  the  theoretical  basis  for  the 
analysis  snd  design  of  systems  such  as  the  distributed 
sensor  network. 

The  rest  of  this  paper  is  organized  cs  follows. 

In  Section  2,  we  present  the  model  to  be  used  for  dis¬ 
tributed  estimation.  Section  3  describes  the  distri¬ 
buted  estimation  problem.  Section  4  describes  the 
basic  results  for  static  random  states.  A  method  for 
generating  the  fusion  formula  for  arbitrary  networks  is 
given.  The  fusion  algorithms  for  some  examples  are 
also  described.  Section  5  extends  the  basic  results  to 
the  case  of  dynamic  random  states.  Section  6  is  the 
conclusion. 

2.  MODEL  FOR  DISTRIBUTED  ESTIMATION 


"This  work  was  supperted  by  the  Defense  Advanced 
Research  Projects  Agency  under  Contract  MDA-903-81-C- 
0333. 


2.1  State  and  Observation  Models 


We  consider  the  estimation  of  a  random  process 
x(t),  t  (  I  where  T  *  It  , «•)  and  x(t)  €  X.  The  random 
process  x(.)  can  be  static,  deterministic  or  a  general 
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Markov  process.  We  assume  the  statistics  which  specify 
the  random  process  completely  are  known. 

Let  S  be  i  finite  set  of  sensors.  At  a  given  time 
t  in  T,  i  sensor  a  generates  an  output  or  measuretaent  x 
in  the  measurement  space  Zg.  The  triple  (x,t,s)  is 
then  called  a  data  set  and  (t,s)  is  the  data  set  index. 
Let  Z_  be  the  set  of  all  data  sets  and  K  be  the  set  of 
all  data  set  indices.  If  we  assume  that  each  sensor 
can  produce  only  a  finite  number  of  outputs  in  any  fin¬ 
ite  time  interval,  the  sets  Z.  and  K  are  at  most  count¬ 
able.  Furthermore,  for  each  t  £  T,  the  restrictions 
Z.  -  {(x,t',s)  Z  It'  St)  and  K,  »  {(t'.s)  Klt'<t>  are 
both  finite.  We  make  two  additional  assumptions: 

1.  The  sensor  origin  and  time  of  each  data  set  are 
known,  i.e.,  for  any  data  set  (z,t,s)  £  Z_,  t  and  a 
are  known  quantities. 

2.  The  sieasurements  are  all  conditionally  independent 

given  the  state  process,  i.e.,  for  any  finite  sub¬ 
set  {(xj.tj.Sj) .  (xk>tk,sk)}  of  Z_, 

k 

Prob, <  O  lx.  £  dz.  )  |x(t. ) . x(t.)) 

i-1  1  1  1  * 

k 

*  II  Prob. (x.  £  dz.  Ix(t))  (2.1) 

i-1  tit 

With  the  second  assumption,  the  observation  process  can 
be  characterized  completely  by  the  transition  probabil¬ 
ities  (or  orobability  densities)  from  X  to  Z(. 

2.2  Data  Bases 

We  are  interested  in  estimation  of  the  process  by 
a  network  of  agents.  At  any  time  t,  due  to  communica¬ 
tion  constraints,  each  agent  may  not  have  access  to  all 
available  data  sets.  In  general,  an  agent  will  have 
only  a  subset  of  the  available  Z.  at  t,  corresponding 
to  only  a  subset  of  K|  A  data  Base  i  at  time  t  is  a 
subset  of  Z and  a  data  index  base  K  at  time  t  is  a 
subset  of  Kj  .  According  to  this  definition,  Z.  <K ,  ) 
is  the  maximum  data  (index)  base  at  t  and  f(th4empty 
set)  is  the  minimum.  Given  any  data  base 
z  “  K*|  ,*-j  ,  Sj  ) , . . .  ,(z.  ,tk,s.  )),  the  corresponding  data 
index  base  K  -  ((t , , s, ) , . . . ,ltk, skj }  is  found  by  the 
operation  K  -  1^(2. *  where  the  definition  of  I  is  obvi¬ 
ous  and  the  actual  measurements  (z. . z  )  are  found 

by  (Xj . zk)  -  My(Z).  When  Z  -  ft  !(♦)*«  4,  and 

M  («)  -  e  where  6  is  a  symbol  representing  "no  informa¬ 
tion". 

For  each  data  index  base  K  -  {(t,,s.) . (tk,s.)> 

with  corresponding  data  base  11  *  * 

Z  -  {(z,,t,,s,),...,(zk,t  ,s.)>  we  define  the  condi¬ 
tional  probability  P(.TZ)  to  mean  P(.lMv(Z),K)  . 

All  the  definitions  above  can  be  given  more 
rigorously  in  terms  of  sigma  algebras.  This  will  not 
be  attempted  in  this  paper  so  ss  to  simplify  the 
development. 

2.3  Communication  Model 

We  assume  there  is  a  finite  set  N  of  estimation 
agents.  Each  agent  n  has  its  own  set  of  sensors,  i.e., 
a  subset  S(  of  S.  Furthermore,  the  sensor  sets  are 
disjoint  ter  different  agents,  i.e.,  S  n  S  .  »  ♦  tor 
n  4  n' .  Each  agent  n  also  receives  information  from 
other  agents  via  communication.  Communication  among 
agents  is  specified  by  the  knovn  communication  schedule 
C  which  is  a  subset  of  T  x  N  x  N.  (t.n-.n.)  f  C  means 
that  agent  nj  transnits  some  messages  to  agent  n,  at 
time  t.  The  exact  form  of  the  messages  will  be  dis¬ 
cussed  later. 


3.  DISTRIBUTED  ESTIMATION  PROBLEM 
3.1  Information  Graph 

The  distributee  estimation  system  (N,  S,  C)  thus 
consists  of  the  sensor  set  S  and  the  estimation  agent 
set  H  together  with  the  communication  schedules.  Four 
types  of  events  affect  the  change  of  information  in  the 
system.  These  events,  the  times  when  they  occur  and 
the  nodes  (sensors  or  estimation  agents)  which  are 
affected,  are  given  below: 

-  sensor  observation:  K, 

-  reception  of  sensor  data  by  estimation  agent: 

Ut.n)  fix  Nl(t,»)  £  K,  s  £  Sn), 

-  transmission  by  estimation  agent: 

{(t,n)  fix  Nl(t,n,n' )  £  C), 

-  reception  of  transmission  by  estimation  agent: 

{(t,n)  <  T  x  N I (t ,n' ,n)  £  C). 


Consider  a  subset  1  of  T  x  (S  U  H)  which  is  the 
union  of  all  the  sets  defined  above.  Define  an  anti¬ 
symmetric  and  trancitive  binary  relation  (or  partial 
ordering) <  on  1.  such  that 

i.  For  each  (n,t,t')  €  NxTxT,  (t.u)Fl_.  (t',n)(  I 
and  t  <  t'  implies  that  (t,n)  -C  (t',n); 

ii.  (t,s)  £  K,  s  £  S  and  (t,n)  €  L  implies  that 
(t,s)  -<  (t,n);  J 

iii.  (t,n,n‘)  £  C  implies  that  (t.n)  -<  (t,n*). 

This  binary  relation  or  partial  order  on  I  thus 
satisfies  all  the  constraints  associated  with  perfect 
communication  as  defined  by  C  as  well  as  perfect  memory 
at  each  processing  node.  (£,-<)  characterizes  the 
information  flow  in  the  system  and  is  called  the  infor¬ 
mation  graph.  If  all  the  sensor  measurements  (data 
sets)  can  be  communicated  perfectly  through  the  commun¬ 
ication  network,  the  data  base  Z(t,i)  for  each  node 
(t,i)  in  the  graph  (L»-<)  can  be  defined  by  beginning 
with  the  minimal  elements  and  following  the  rules  shown 
below: 

i.  If  (t,i)  is  a  receiving  node, 

Z(t,i)  •  (Z(s, j) l(s, j)  ->  (t,i)>; 

ii.  If  (t,i)  is  a  transmitting  node, 

!Z(a,j)  if  (s,j)  ->  (t,i) 

((x(k) ,k) )  if  (t,i)  »  k  £  K 
4  otherwise. 

In  the  above  (s,j)  ->  (t,i)  means  that  (s,j)  is  an 
immediate  predecessor  of  (t,i)  and  (z(k),k)  f  Z  is  the 
unique  element  whose  second  component  is  k. 

With  this  construction  of  the  data  base,  we  see 
that  (t,i)  -<  (a,j)  if  and  only  if  Z(t,i)  5  Z(s, j). 
Similar  remarks  can  be  made  for  the  data  index  uase 
K(t,i).  Since  there  is  s  natural  direction  (along 
increasing  time)  in  the  graph,  the  arrowheads  on  the 
edges  in  a  pictorial  representation  of  the  graph  can  be 
omitted.  We  would  also  omit  those  edges  which  are  due 
to  transitivity.  From  the  graph,  the  flow  of  informa¬ 
tion  in  the  system  becomes  very  obvious.  A  node  (t,i) 
is  a  parent  of  (s,j)  it  information  flows  from  node  i 
at  time  t  to  node  j  at  time  s.  Note  that  in  the  infor¬ 
mation  graph,  the  receiving  nodes  correspond  to  the  ' 
events  when  estimates  have  to  be  updated  with  the 
arrival  of  new  information.  For  many  applications,  it 
is  sufficient  to  use  a  reduced  information  graph,  which 
is  obtained  by  considering  only  these  receiving  nodes. 

Several  examples  of  distributed  estimation  net¬ 
works  and  their  reduced  inforsution  graphs  are  shown  in 
Figures  1-4  where  the  hollow  circles  and  the  solid  cir¬ 
cles  are  the  communication  reception  and  sensor  data 
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recaption  noder  respectively. 

Baaala  1  (Fution  Without  Coordination) :  Of  the  agent* 
in  M,  agent  1  ia  a  fusion  agent  and  the  rest  are  local 
agenta.  The  local  agents  transmit  to  the  fusion  agent 
after  they  receive  the  data  iron  the  tensors  and  per- 
f ora  local  processing.  Figure  1  shows  the  structure  of 
the  system  (for  three  agents)  and  the  infometion 
graph. 

Eraaole  2  (Fusion  With  Coordination) :  This  is  similar 
to  Example  1  except  that  right  after  fusioo,  agent  1 
communicates  with  the  local  agents  tgsin.  This  struc¬ 
ture  is  also  equivalent  to  a  broadcast  tystea  where  all 
agents  coanunicate  with  each  other.  Figure  2  shows  the 
structure  of  the  ays tea  and  the  information  graph. 


This  lemma  states  that  since  p(xfZ.)  and  p(xtZ2) 
both  include  inforaation  contained  in  the  data  base 
Z.A  Z_,  this  cosmos  inforaation  has  to  be  removed  so 
that  it  does  not  get  double  counted.  Lcomu  1  plays  a 
central  role  in  distributed  estimation  theory  similar 
to  the  usual  Bayes'  rule  in  centralised  estimation 
theory.  When  the  conditional  probabilities  from  multi¬ 
ple  agents  are  combined,  the  fusion  formula  can  be 
obtained  by  repeated  applications  of  Lemma  1.  The  fol¬ 
lowing  gives  the  results  for  three  agents. 


fcemma  2.:  Suppose  Zj,  and  Zj  are  data  bases  at  the 
information  nodes  lj  2  and  3.  ’'Then 


p(xiZ.  U  Z,)  p(*IZ,) 

p(xlZxU  z2  0  z3)  -  c  p(xUz1u  z2)n  z3) 


Example  I  (Cyclic  Communication) :  This  is  the  example 
considered  in  [2].  The  agents  are  arranged  in  a  circle 
so  that  each  agent  transmits  only  to  its  immediste 
neighbor  in  a  cyclic  manner  at  the  specified  communica¬ 
tion  times .  Figure  3  show*  the  example  for  N  ■ 

(1.2,3). 

Example  4  (Multipath  Pattern) :  The  agents  are  arranged 
a*  in  Figure  4.  agent  1  can  only  get  information  from 
agent  4  via  agents  2  and  3. 


ptsiOj/  pvxtdg/  pixiOj/  punjii  t2my 

p(x  IZj  n  Zj)  p(xlZjA  Zj)  pfxlZjOZ^ 


This  lemma  again  has  a  very  intuitive  explanation. 
The  terms  in  the  denominator  conaiat  of  pairwise  redun¬ 
dant  information  to  be  removed.  When  these  are 
removed,  all  information  which  is  common  to 
Z.,  Z-,  and  Z,  is  also  removed.  This  then  haa  to  be 
reatored. 


3.2  Distributed  Fusion  Problem 

The  problem  is  to  compute  p(x(t) |Z(t , i))  for  each 
node  (t,i)  f  i.  in  the  graph  (I_,-<)*  Since  the  condi¬ 
tional  probabilities  or  any  estimates  are  updated  only 
at  the  receiving  nodes  (extrapolation  is  carried  out  at 
the  other  nodes),  we  neeo  only  to  consider  the  computa¬ 
tions  at  the  following  two  types  of  nodes  in  the 
reduced  information  graph:  sensor  data  reception  nodes 
and  coanminication  reception  nodes. 

At  a  sensor  data  reception  node  (t,i),  computation 
of  p(x(t) IZ(t,i))  is  straightforward.  The  standard 
Bayesian  update  tormala  would  suffice.  At  a  communica¬ 
tion  reception  node,  the  objective  is  to  reconstruct 
p(x(t)IZ(t,i))  from  tue  conditional  probabilities 
(p(x(t) IZ(s, j)) I ( s , j)  -<  (t,i)>.  This  problem  is  the 
distributed  furiou  problem:  construction  of  the  condi¬ 
tional  probability  given  all  the  data  sets  which  would 
have  been  communicated  through  the  network  using  only 
the  conditioual  probabilities  available  at  the  prede¬ 
cessor  nodes  in  the  information  graph. 

4.  STATIC  KESULTS 

In  this  section  we  develop  the  main  results  for 
fusion  for  each  agent  i,  assuming  the  random  process  is 
static,  i.e.,  x(t)  ■  x  for  all  t.  Since  the  informa¬ 
tion  from  different  agents  may  overlap,  care  has  to  be 
taken  when  the  conditional  probabilities  from  different 
agents  are  combined.  In  particular,  any  redundant 
information  has  to  be  identified  so  that  it  is  not  used 
more  than  once.  The  following  lemmas  provide  the 
mechanism  for  doing  this.  In  the  following  x  denotes  a 
random  vector  with  prior  probability  p(x>  and  Z_  is  the 
set  of  all  data  sets. 

4.1  Basic  Kcsults 

We  state  toe  following  lemmas  without  proofs,  some 
of  which  can  be  found  in  (16]. 

Lemma  1:  Suppose  Zj  and  Zj  are  data  base*  at  two 
information  nodes  1  and  2.  Then 

p(xlZ.)  p(xlZ-) 

p(xlZ1UZ2)-C  (4.D 

where  C  is  a  normalisation  constant. 


If  all  the  random  elements  involved  are  Gaussian, 
the  lemmas  above  can  be  simplified  so  that  only  the 
conditional  means  and  covariances  are  involved.  Sup¬ 
pose  x  is  Gaussian  with  van  m  and  covariance  P(0). 

Let  x(Y)  and  P(Y)  be  the  mean  and  covariance 
corresponding  to  the  conditional  density  p(x|Y).  Then 
lemma  1  becomes 

Lama  1A: 

P(ZiU  Zj)"1  -  P(Zj)"1  ♦  Pttj)*1  -  P(Z1  A  Z2)_1  (4.3) 

and 

P(ZjU  Z2)_1  S(Z10  z2)  .  Pttjr^ttj)  ♦  P(Z2)_1x(Z2) 

-  P(Z2A  Z2)_1x(Z1A  Z2).  (4.4) 

Lena  2  can  be  simplified  in  a  similar  way.  Lemma 
1A  is  identical  to  that  used  in  [9]  for  deriving  the 
optimal  algorithms  for  combining  estimates  of  linear 

Gaussian  systems. 

We  now  state  the  static  fusion  problem  for  each 
agent  assuming  that  x(t)  -  x  for  all  t.  The  problem  is 
stated  for  the  case  when  messages  are  received  from 
only  one  agent.  But  the  extension  to  multiple  agents 
is  obvious. 

Static  Fusion  Problem 

Suppose  agent  i  receives  a  message  from  agent  j  at 
time  s  in  the  form  of  a  conditional  probability 
p(xlZ(s,j)).  Let  (t,i)  be  the  immediate  predecessor  to 
(s,i)  for  ageot  i.  Agent  i's  data  base  then  changes 
from  Z(t,i)  to  Z(s,i)  ■  Z(t,i)U  Z(r,j)  where  (r,j)  is 
the  immediste  predecessor  to  (s,j)  for  agent  j.  The 
objective  is  to  find  p(xlZ(s,i))  in  terms  of 
p(x|Z(t,i)),  p(x|Z(r, j))  and  possibly  other  conditional 
probabilities  defined  on  the  information  graph,  i.e., 
(p(x|Z(t',i'))|(t',i')<  (r,i)). 

We  do  not  specify  a  prir->  which  conditional  pro¬ 
babilities  are  involved  except  ,  ‘.ave  to  be  condi¬ 
tional  on  some  data  base  Z.  defined  on  the  information 
graph  and  that  they  should  be  available  through  commur- 
ication.  The  following  recursive  algorithm  allows  us 
to  find  the  set  of  needed  conditional  probabilities  and 
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ho*  they  should  be  combined. 

Algorithm  tsi.  Static  State 

The  algorithm  consists  of  repeated  applications  of 
the  following  steps. 


Example  1.  (Fusion  Without  Coordination):  Consider  the 
fusion  time  a.  let  t  be  the  observation  time  immedi¬ 
ately  before  a.  With  the  information  graph  it  ia  easy 
to  see  that  Z(s-l,l)T\Z(t,2)  «Z(t-l,2).  Thus 


p(xlZ(s-l,l)U  Z(t,2))  -  C 


plxjZlt^U.EixlZia^L.UI 

p(x IZ ( t-1, 2 ) ) 


Step  1:  Since  Z(t,i)  and  Z(r,j)  are  subsets  of  Z_. 
Lemma  1  gives 


p(x|Z(s,i))  «  p(xlZ(t,i)U  Z(r,j)) 

.  c  p(alZ(t.i))  p(xlZ(r.i)> 
C  p(*|Z(t,i)n  Z(r,j)) 


(4.5) 


If  Z(t,i)  r\  Z(r,j)  ia  the  data  base  for  some  node  in 
the  inforaation  graph,  i.e.,  Z(t,i)A  Z(r,j)  -  Z(q,k) 
for  some  (q,k)  in  J.  or  if  it  ia  empty,  then  the  algo¬ 
rithm  terminates.  If  not,  Step  2  is  used.  In  terms  of 
the  information  graph  representation  introduced  in  Sec¬ 
tion  5,.  this  step  is  particularly  simple.  We  start 
from  two  information  nodes  (t,i)  and  (r,j)  whose  condi¬ 
tional  probabilities  are  to  be  combined.  Z(t,i)A 
Z(r,j)  corresponds  to  the  information  of  all  those 
nodes  which  are  parents  of  both  (t,i)  and  (r,j). 

Step  2;  Let  Ut.,k,),  (t,,k,),...}  be  the  set  of  com¬ 
mon  predecessors  of A (t , i)  and  (r,j)  in  the  information 
graph.  Then 

Z(t,i) 0  Z(r, j)  •  Zftj.kj) U  Z(t2>k2) U  ...  (4.6) 

Step  1  can  now  be  repeated  with  the  help  of  Lemma  1 
(and  its  multiple  agent  version)  to  exprers  p(xlZ(t,i)0 
Z(r,j))  in  terms  of  the  conditional  probabilities 
p(xl2(t . ,k. )),  i  •  1,  2,...,  and 

p(xlZ(t\k*)n  Z ( t  •  ,k . ) ) ,  i  -  1,  2,...,  j  -  1,  2,  .... 
etc.  The  algorithm  terminates  when  all  the  conditional 
probabilities  are  defined  on  nodes  in  the  information 
graph  or  coincide  with  the  a  priori  diatributions. 

By  applying  this  algorithm,  p(xlZ(t,i)U  Z(r,j)) 
can  be  expressed  in  terms  of  products  and  ratios  of 
conditional  probabilities  defined  on  information  nodes. 
Each  product  corresponds  to  the  fusion  or  combining  of 
information  whereas  each  division  corresponds  to  the 
removal  of  redundant  information.  Hote  that  in  general 
it  it  not  sufficient  to  use  only  the  conditional  proba¬ 
bilities  p(x|Z(t,i))  and  p(xlZ(r,j))  unless  Z(t,i)  and 
Z(r,j)  happen  to  be  disjoint  or  there  is  a  node  (s,k) 
such  that  Z(t,k)  ■  Z(t,i)  A  Z(r,j).  Additional  condi¬ 
tional  probabilities  from  the  past  are  also  needed  so 
that  the  redundant  information  in  Z(t,i)  and  Z(r,j)  can 
be  identified  and  resioved. 


We  have  thus  solved  the  fusion  problem  for  each 
agent  in  a  distributed  estimation  network.  This  algo¬ 
rithm  also  provides  us  with  the  set  of  conditional  pro¬ 
babilities  which  needs  to  be  stored  at  each  agent  plus 
the  additional  set  of  conditional  probabilities  which 
needs  to  be  communicated . 

When  the  random  eleuenta  involved  are  all  Gaus- 
aian,  the  sufficient  statistics  for  the  conditional 
probabilities  become  the  conditional  means  and  covari¬ 
ances.  With  the  help  of  Lemma  IA,  we  can  again  apply 
the  algorithm.  Instead  of  multiplication  and  division 
of  probabilities,  however,  we  now  have  operations 
involving  conditional  means  and  covariances.  The 
results  are  straightforward  and  will  noc  be  presented 
here. 


4.2  Static  Examples 


By  a  recursive  argument,  we  can  show  that 

p(xlZ(s,l) )  -  C  n  P(*lz<«-M>).  (4.7) 


Each  term  in  the  product  contains  the  new  informa¬ 
tion  contained  in  the  new  measurement  z(t,i)  of  agent 
i.  All  other  information  is  already  known  to  agent  1. 
The  fusion  problems  of  the  other  agents  are  similar. 

Examule  2  (Fusion  with  Coordination) :  This  is 
equivalent  to  broadcast  communication.  From  the  infor¬ 
mation  graph,  the  algorithm  gives  for  j 

p(x|Z(s, j))  -  C  n  p(xlZ(s-l, j)).  (4.8) 

Eacn  term  in  the  product  is  the  new  information  con¬ 
tained  in  measurement  z(t,i). 


Example  J.  (Cyclic  Communication) :  The  algorithm  gives 
for  general  i  -  1,2,3 


p(x!Z(s,i)) 


c  p(xjz(t.D) 

L  p(xlZ(t-2,i)) 


p(xlZ(t-l, li+1) )) 


p(xlZ(a-3,i)) 


(4.9) 


where  lil  is  i  modulo  3. 


Thus,  in  addition  to  the  most  current  conditional 
probability  p(x|Z(t,l)),  agent  1  has  to  remember  three 
other  probabilities.  Note  that  p(x IZ (t-1, 2) )  iB  avail¬ 
able  to  agent  1  from  earlier  communications.  This 
indicates  that  in  a  distributed  estimation  network, 
knowing  the  most  recent  estimate  is  frequently  not  suf¬ 
ficient  if  one  wants  to  recover  the  globally  optimal 
estimate.  In  fact,  it  has  been  shown  via  simulation  in 
12}  that  if  a  suboptimal  rule  of  combining  estimates  is 
used,  such  as 

p(x|Z(t,l)U  Z(t,2))  a  C  p(xlz(t,l))  p(xlZ(t,2))  (4.10) 


for  agent  1  sod  similar  rules  for  agents  2  and  3,  the 
agents  agree  asymptotically.  This  is  consistent  with 
the  results  on  asymptotic  agreement  in  distributed 
estimation  as  given  in  (14).  However,  the  agents  can 
converge  to  the  wrong  estimate  as  demonstrated  in  [2]. 
Thus,  although  optimal  fusion  algorithms  are  iu  general 
more  complicated,  requiring  more  memory  and  more  compu¬ 
tation,  they  are  nonetheless  necessary  it  good  perfor¬ 
mance  is  needed.  A  suboptimal  algorithm  has  also  been 
tested  in  [2j  and  shown  to  hove  some  nice  properties. 


Example  4  (Multipath  Pattern) :  The  fusion  problems  of 
agents  2  and  3  are  straightforward.  For  agent  1, 
repeated  use  of  the  algorithm  (with  the  help  of  the 
information  graph  in  Figure  4)  gives 


p(x|Z(s,D)  ■  C 


p(x|zu-i,3) 


p(x  IZ  (t-2 ,4) ) 
n(x IZ (t-1 ,4) ) 


p(xlZ(t,l)) 


) 


(4  11) 


In  addition  to  the  conditional  probabilities  from 
agents  2  and  3,  conditional  probabilities  by  agent  4 
are  also  needed.  These  would  have  to  be  relayed  by 
agents  2  or  3. 


In  the  following  we  assume  the  measurements  are  Io  the  above  examples,  general  fusion  formulas  are 

•ad:  at  times  {...,  t-1,  tel,...}  and  meaaagea  are  given.  If  the  random  vectors  are  all  Gaussian,  these 

received  at  times  {...,  s-1,  s+1,...}  with  s-1  <  t  <  s.  formulas  can  be  simplified  using  Lemma  IA. 
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5.  DYNAMIC  IBSULTS 

Aiime  now  that  x(.)  is  a  Harkov  process.  The 
{uaion  problem  tor  each  agent  will  now  be  considered. 
Since  the  data  sets  are  no  longer  conditionally 
independent  given  x(t),  one  immediate  question  is  the 
choice  ot  an  appropriate  “state"  whose  conditional  pro¬ 
babilities  would  be  computes,  transmitted  and  combined 
by  the  various  agents.  Let  T(t,i)  be 

T(t, i)  -  <t'  £  Tl(t'.i')  (  K(t , i) } ,  (5.1) 

and 


(x(t  ))t.  f  T(ti) 


for  each  information  node  (t,i)  where  fusion  is  to  be 
performed.  Then  the  problem  is  effectively  reduced  to 
a  static  problem  of  the  type  considered  in  Section  4. 
Using  the  independence  assumptions  on  the  measurements 
in  the  data  base  given  y,  the  algorithm  in  Section  4 
can  be  applied.  However,  this  means  that  the  condi¬ 
tional  probability  of  a  high  dimensional  random  vector 
y  would  have  to  be  stored  and  transmitted.  From  an 
implementations!  point  of  view,  this  may  not  be  feasi¬ 
ble. 


For  deterministic  random  processes,  which  can  be 
characterized  by  the  state  at  one  given  time,  an  obvi¬ 
ous  choice  is  to  estimate  x(t  )  where  t  is  the  minimum 
in  the  set  T.  Again,  due  to  8he  Markov°property ,  the 
conditional  independence  assumption  is  satisfied  and 
the  algorithm  -an  be  used.  However,  if  there  are  sub¬ 
stantial  changes  in  the  process,  x(tQ)  may  not  be  the 
state  of  interest.  In  this  section,  we  characterize 
the  more  current  states  whose  conditional  probabilities 
ought  to  be  transmitted  and  combined. 

The  following  generalization  of  Lemma  1  is  needed. 


Then  for  a  deterministic  random  process  x(.),  equation 
(5.5)  holds  with  x  replaced  by  x(t»),  where 

t*  *  minit'Kt.s)  <  L(t,i)-{(t",s")}}  (5.6) 

and  (t.",s“)is  the  minimal  elsswnt  in  L(t,i). 

The  proof  is  straightforward  and  is  based  on  the 
algorithm  ot  Section  4  and  Leaa  3.  This  theorem 
states  that  for  random  processes,  in  general  the  fil¬ 
tered  estimate  represented  by  the  conditional  probabil¬ 
ities  p(x(t ) |2(t , i) )  may  not  be  adequate  tor  optimal 
fusion  at  time  t.  Sometimes  the  agents  need  to  have 
the  conditional  probabilities  of  the  states  at  some 
earlier  tunes.  Thus,  smoothed  estimates  are  frequently 
needed.  From  this,  the  estimates  of  the  current  states 
can  be  obtained  easily  by  extrapolation.  When  this 
theorem  is  applied  to  the  examples  in  Section  4,  we 
obtain  the  following  results. 


Example  1,  (Fusion  without  Coordination):  In  the  fusion 
equation  (4.7),  the  state  to  be  estimated  is  x(t). 

This  is  consistent  with  the  results  in  [8]— [ 12 J .  As  a 
variation  of  this,  consider  a  periodic  fusion  situation 
where  the  local  agents  acquire  measurements  at  a  higher 
rate  than  they  communicate  with  the  fusion  agent  (Fig¬ 
ure  5).  Specifically,  let  the  new  fusion  time  set  tor 
agent  1  be  {...,a-M,s,a+M,..,}  where  M  is  the  number  of 
time  units  between  communication.  Then  application  of 
the  theorem  yields 


p(x(t-M+l) |Z (a, 1 ) ) 


p(x(t-Msl )  IZ  (t-H, i) ) 


p(x(t-M*l)IZ(s-M,l))  (5.7) 


Thus,  the  state  ot  interest  is  now  x(t-H-l),  and  each 
term  in  the  product  containa  the  new  information  ot 
agent  i  about  this  state. 


Lemma  J:  Consider  a  random  vector  y  and  data  bases  Example  2  (Fusion  with  Coordination):  In  equation 

Zj  and  Zj  defined  on  the  information  graph.  Suppose  (4.8),  the  state  is  x(t). 

p(Z.-Z-,Z,-Z,  IZ.  A  Z_,y)  Example  3  (Cyclic  Communication):  Inequation  (4.4), 

the  state  is  x(t-Z).  Thus,  extrapolation  is  needed  if 
“  plZj-ZjIy.Zj  FI  Z2)  p(Zj-Zj  ly.Zj  ft  Zj).  (5.3)  the  estimate  of  x(t)  is  needed. 


Then 


pfylZjU  Z2)  -  C 


pfylZ^  p(ylZ2) 
pfylZjH  Z2> 


Example  4  (Multi path  Pattern) :  In  equation  (4.11),  the 
state  is  x(t-l).  Thus,  extrapolation  is  again  needed. 

6.  CONCLUSION 


where  C  is  a  normalization  constant  and  A-S  denotes  the 
difference  ot  the  sets  A  and  B. 

Lemma  3  states  that  evenlthough  the  individual 
measurements  in  Z,  do  not  satidty  the  conditional 
independent  assumptions  given  V,  Equation  (5.4)  (which 
is  the  same  as  (4.1))  is  still!  valid  provided  the 
private  data  bases  Z^-Z  —  Zj-Z^  are  conditionally 
independent  given  the  state  y  and  the  common  informa¬ 
tion  Z  j  A  Zj.  1 

We  can  now  state  the  following  theorem  which 
characterizes  the  state  vector  which  should  be 
estimated  tor  deterministic  dynamic  random  processes. 

Theorem:  Consider  the  fusion  problem  for  the  informa¬ 
tion  node  (t,i)  assuming  a  deterministic  random  process 
x.  It  the  algorithm  of  Section  4  yields  the  fusion 
formula 

p(x IZ ( t , i) )  -  F(p(x|Z(t',i'));(t',i')  (  L(t,i))  (5.5) 

where  F  is  a  function  consisting  of  products  and  ratios 
of  p(xlZ(t'  ,i'  ))'s  in  the  set  L(t,i),  and  L(t,i)  is  a 
subset  of  the  predecessor  information  nodes  of  (t,i). 


We  have  presented  a  formalism  for  the  distributed 
estimation  problem.  Using  this  formalism,  the  optimal 
fusion  algorithm  for  each  agent  in  the  network  has  been 
developed  for  arbitrary  network  structures.  Both 
results  for  static  and  deterministic  dynamic  random 
states  have  been  described,  and  illustrated  with  exam¬ 
ples.  The  results  have  been  presented  for  very  general 
state  and  observation  models.  Special  cases  such  as 
linear  models  with  Gaussian  noises  can  be  considered. 

An  interesting  special  case  tor  distributed  multitarget 
tracking  and  classification  has  also  been  investigated 
and  briefly  reported  in  (2l.  The  details  will  appear 
elsewhere . 
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Figure  2  Fusion  With  Coordination 


Figure  3  Cyclic  Communication 
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Figure  4  Multipath  Pattern 


Figure  5  Periodic  Fusion 
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1.  Introduction 

The  problem  of  multiobject  tracking  has  been  stud¬ 
ied  extensively  over  the  past  few  years.  Good  summa¬ 
ries  of  these  approaches  are  available  In  the  survey 
papers  by  Reid  [1]  and  Bar-Shalom  (2|.  There  are  two 
drawbacks  that  are  common  to  all  the  algorithms  that 
have  been  developed  and  studied  so  far.  Firstly,  the 
relationship  of  the  algorithm  to  the  optimal  (Bayesian) 
algorithm  cannot  be  clearly  seen  due  to  the  ad  hoc  and 
sometimes  arbitrary  approximations  that  are  Introduced 
Into  the  algorithm.  Secondly,  none  of  the  algorithms 
explicitly  model  or  account  for  maneuvering  targets. 
Several  algorithms  1 3 j— 1 7  J  have  studied  maneuver  detec¬ 
tion  and  estimation  for  single  targets.  However,  the 
contradictory  requirements  imposed  by  maneuvers  and 
clutter  for  the  selection  of  gate  size  has  prevented 
any  of  these  suboptlmal  algorithms  from  being  extended 
.  __  to  the  multlobject  tracking  problem. 

^  y*  '  1  * '  v 

7 — Itt^thls  paper  -we-  develop-  an  algorlthmyhat  over  \  J ' 

comes  both  these  drawbacks.  We  formulate1' the  multToIr- . 

J  ec  tracking  problem  within  the  framework  of  a  hybrid 
state  estimation  problem.  This  permits  the  construc¬ 
tion  of  the  optimal  s'lutlon  to  the  multlobject  track¬ 
ing  problem.  However,  due  to  the  exponentially  growing 
storage  and  computational  needs  of  the  algorithm  with 
time,  some  form  of  suboptlmal  approximations  have  to 
be  made.  Since  these  approximations  are  made  within 
the  framework  of  the  optimal  algorithm,  the  nature  and 
trade-offs  associated  with  the  approximations  can  be 
examined.  Furthermore,  optimal  and  suboptlmal  algo¬ 
rithms  for  maneuver  detection  and  estimation  can  also 
be  Incorporated  within  this  framework.  _ ^ 

The  paper  Is  organized  as  follows ^  In  Section  2 
n  formulateithe  problem  of  multlobject  tracking  of 
maneuvering  targets  within  the  framework  of  the  hybrid 
state  estimation  problem.  The  computer  implementation 
of  the  optimal  solution  is  discussed  In  Section  3. 
Suboptlmal  features  that  reduce  the  computational  re- 
^^pilrements  are  discussed  in  Section  A.  Finally,  In 

Section  5,  we  provider, some  simulation*  results.  <£ - - 

\  .  i'  ' 

\ 

2.  Problem  Formulation  and  Optimal  Solution 


A  multlobject  tracking  algorithm  Involves  two 
basic  functions  -  association  of  measurements  with 


*Thls  work  waa  supported  by  the  Department  of  the  Navy, 
Naval  Air  Systems  Command,  under  contract  number 
N00019-C-82-0A56. 


postulated  targets  and  utilization  of  the  measurements 
to  track  the  targets.  The  association  of  the  measure¬ 
ments  with  postulated  targets  can  be  viewed  as  a 
discrete-valued  state  estimation  problem.  Subsequent 
to  making  these  associations,  tracking  the  targets 
corresponds  to  a  continuous-valued  state  estimation 
problem.  As 

The  model  for  a  hybrid  system  can  be  represented 


x(k+l)  -  A(k,  q(k)]  x(k)  +  £(k,  q(k))  (1) 


-  C(k,  q(k) }  x(k)  ♦  _n(k,  q(k>) 


x  represents  an  n-dlmenslonal  concinuoua- 

—  valued  state  vector. 

1  > 

£  represents  an  m-dlmenslonal  measurement 
vector. 

A  Is  an  nxn  matrix  representing  the  tran¬ 
sition  dynamics. 

C  Is  an  m*n  matrix  representing  the  mea¬ 
surement  process. 

£  Is  an  n-dlmenslonal  white-noise  process 
representing  model  uncertainty.  - 

jn  is  an  m-dlmenslonal  white-noise  process 
representing  measurement  uncertainty. 

q  represents  a  discrete-valued  stochastic 
process  which  take  on  l  values. 

k  represents  the  time  index. 

Notice  that  the  matrices  A  and  C  and  the  charac¬ 
teristics  of  the  noise  process  are  controlled  by  the 
process  q.  Their  dimensions  n  and  m  are.  In  general, 
dependent  on  time.  The  discrete-valued  process  is 
assumed  to  be  a  Markov  process  with  an  1  which  is  also 
time-dependent . 

As  mentioned  earlier,  the  key  features  of  a  multi¬ 
object  tracking  algorithm  in  a  track-while-scan  system 
can  be  captured  by  this  model.  For  example,  x  repre¬ 
sents  the  composite  rtate  of  all  postulated  targets  and 
£  represents  the  composite  vector  of  all  measurements 
at  scan  k.  Further,  the  value  of  q(k)  specifies  the 


* 


association  of  measurements  with  postulated  nodels 
through  the  matrices  A  and  C.  We  will  elaborate  this 
in  Section  3. 

An  optimal  estimate  of  the  state  of  the  hybrid 
system,  represented  by  Eq,  1  using  the  measurements  in 
Eq.  2,  can  he  obtained  by  computing  the  joint  posterior 
probability  density 

p[x(k),  qk[zkj  (3) 


where 


zk  -  (z(l),  z(2),...,z(k)) 


and 


qk  -  (q(l),  q(2) . ,q(k) ) 


functions  In  Eq.  6  can  be  obtained  by  computing  their 
sufficient  statistics  using  a  bank  of  Kalman  filters. 
Now  the  hypothesis  qk  represents  a  particular  sequence 
of  target  and  measurement  associations.  If  a  system¬ 
atic  method  Is  used  to  compute  these  probabilities  In 
Eq.  4,  then  the  sufficient  statistics  for  constructing 
the  posterior  probability  of  jc(k)  will  be  available. 
Using  this,  either  a  Maximum-a-Posteriori  (MAP)  or  a 
Mlnimum-Variance-of-Error  estimator  can  be  constructed. 
In  the  next  section  vs  discuss  how  the  probabilities  In 
Eq.  4  can  be  reconstructed  for  the  multlobject  tracking 
problem. 

3.  Implementation  of  the  Optimal  Algorithm 

As  pointed  out  earlier,  the  essential  difficulty 
In  developing  the  optimal  algorithm  Ilea  in  construct¬ 
ing  all  the  possible  combinations  of  targets  with  mea¬ 
surements  and  then  computing  their  probabilities.  The 
combinatorial  problem  Is  even  more  aggravating  because 
of  changes  In 

1.  number  of  targets  due  to  births 
and  deaths, 


For  most  hybrid  systems  of  practical  Importance,  the 
transitions  of  the  discrete-valued  state  are  Indepen¬ 
dent  of  jx(k).”  In  such  cases  the  probability  of  a 
particular  sequence  qk  (which  we  will  refer  to  as  a 
hypothesis)  can  be  computed  recursively  as 


2.  dynamic  models  of  targets  due  to 
maneuvers ,  and 

3.  measurement  characteristics  due  to 
clutter  or  missed  measurements. 


p[<tklik]  “  —  [p(*.C*t> |qk.  i11"1)  p(q(k)|qk~l,  £k-1) 
Ck 

p(qk_1l£k"1)] 


where 

.Cfc  represents  the  summation  of  the  numerator 
over  all  possible  qk. 


Two  approaches  recommended  In  the  past  (9]  have 
attempted  to  construct  the  hypotheses  in  the  form  of 
a  matrix.  In  one  of  these  approaches,  referred  to  as 
the  target  oriented  approach'  ( 1 1 ,  the  postulated 
targets  define  the  columns  of  the  matrix  and  the  pos¬ 
tulated  hypotheses  define  the  rows.  The  entries  of 
(4)  the  matrix  represent  measurements.  Then  for  a  given 
row  (hypothesis) ,  the  column  numbers  and  the  measure¬ 
ments  in  the  associated  columns  specify  the  target- 
measurement  pairs  posulated  by  that  hypothesis.  A 
typical  hypotheses  matrix  Is  shown  In  Fig.  1.  The  'O' 
entries  indicate  that  the  target  Is  not  detected. 


The  marginal  density  for  jc(k)  can  then  be  computed 
recursively  from 


TARGET  NUM8ER 
12  3 


p[x(k)|itk]  «  l  p(.x(k)|qk,  jtk)  p(qk|.zk} 
qk 


where 


(5) 


p[x(10|qk,  £k] 


I 

Ck 


p(£(k)|qk,  x(k),  £k_l) 


p(x(k)|qk,  £k*l) 


(6) 


To  summarize,  the  solution  of  the  hybrid-state 
estimation  problem  Involves  the  computation  of  the 
conditional  density  functions  in  Eq.  6  along  with  the 
probabilities  of  the  hypotheses  using  Eq.  4.  This  en¬ 
ables  the  computation  of  Che  posterior  probability  of 
x(k)  given  in  Eq.  5. 

For  the  multiobject  tracking  problem  at  hand,  if 
we  assume  that  the  dynamics  of  the  Individual  targets 
(and  measurement  process)  are  linear  and  toe  noise 
processes  are  Gaussian,  then  the  conditional  density 


*Such  hybrid  systems  have  also  been  referred  to  as 
Event-Driven  Dynamic  systems  or  Dynamic  Systems  in  a 
Switching  Environment  [8], 


Figure  1.  Hypotheses  Matrix  for  Target- 
Oriented  Approach 

In  the  alternate  approach,  referred  to  as  the 
'measurement  oriented  approach, '  the  roles  of  the 
targets  and  measurements  are  interchanged.  A  typical 
hypotheses  matrix  using  this  approach  is  shown  In  Fig. 
2.  Here  the  'O'  entries  denote  that  the  measurements 
corresponding  to  those  columns  are  assumed  to  be  false 
alarms . 
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MEASUREMENT  NUMBER 


1  2  3 


Figure  2.  Hypotheses  Matrix  for  Measurement- 
Oriented  Approach 

Both  approaches  have  drawbacks.  For  example,  in 
the  target-oriented  approach  Measurements,  not  Included 
in  a  row,  could  correspond  to  either  new  targets  or 
false  alarms;  this  cannot  be  shown  explicitly.  Simi¬ 
larly,  in  the  measurement-oriented  approach,  targets 
not  included  in  a  row  could  either  have  died  or  were 
not  detected.  The  hypothesis  matrix  cannot  display  it. 
Furthermore,  neither  of  the  approaches  can  account  for 
target  maneuvers. 

To  overcome  these  problems,  we  have  chosen  to  cre¬ 
ate  the  hypotheses  at  any  scan  in  a  novel  fashion  which 
is  also  intuitively  appealing.  Rather  than  represent¬ 
ing  the  hypotheses  in  the  form  of  a  matrix,  this  ap¬ 
proach  maintains  a  set  of  target  trees  and  a  list  of 
global  hypotheses.  The  root  of  each  target  cree  repre¬ 
sents  the  birth  of  the  target  and  the  branches  repre¬ 
sent  the  different  dynamics  that  the  target  can  assume 
and  the  various  measurements  it  can  be  associated  with 
in  subsequent  scans.  A  trace  of  successive  branches 
from  a  leaf  to  the  root  of  the  tree  corresponds  to  a 
potential  track  of  the  target.  The  leaf  cf  each  such 
trace  is  unique  and  is  referred  to  as  a  track  node  of 
the  target  trte. 

Each  element  of  the  global  hypotheses  list  con¬ 
tains  a  3et  of  pointers  which  point  to  track  nodes. 

They  represent  the  combination  of  track  nodes  postu¬ 
lated  by  the  global  hypothesis  which  that  element 
represents.  Obviously,  the  collection  of  pointers  in 
any  one  such  global  hypothesis  cannot  point  to  tvo 
track  nodes  within  the  same  target  tree. 

The  creation  of  the  global  hypotheses  using  target 
trees  and  global  hypotheses  list,  enables  the  decompo¬ 
sition  of  the  process  of  associating  targets  with  mea¬ 
surements  into  that  of  associating  measurements  with 
each  of  the  targets  and  then  forming  combinations  of 
the  resulting  tracks.  As  such,  we  refer  to  this  as  a 
Track-Oriented  approach.  The  expansion  of  the  individ- 
vidual  tracks  at  any  scan  can,  in  turn,  be  done  in  two 
stages.  First,  the  tracks  are  split  for  possible  dy¬ 
namics  and  next  these  tracks  are  associated  with  the 
measurements.  By  assuming  that  the  target  dynamics 
are  independent  of  the  measurement  characteristics, 
the  transition  diagrams  for  each  of  the  targets  and 
the  measurements  will  have  the  simple  form  described 
below. 

The  discrete  states  and  the  associated  transition 
diagram  for  a  single  maneuvering  target'  is  considered 
first.  The  target  starts  off  in  an  unborn  state  (B), 


is  born  at  some  scan  and  can  then  die  (X)  at  some  later 
scan.  A  target  that  is  in  the  born  state  can  have  a 
constant  velocity  (nonmaneuver  state  S)  or  be  acceler¬ 
ating  (maneuver  state  M).  To  allow  for  different  ac¬ 
celerations  that  the  target  can  undergo,  there  could  be 
several  maneuver  states  M^  (i-l,n«).  This  is  depicted 
in  Fig.  3  where  we  'nave  considered  the  case  where  n0«2. 
For  convenience  in  representing  the  transition  diagram, 
we  have  made  the  following  assumptions: 

1.  The  target  is  always  born  into  the  non¬ 
maneuver  state. 


2.  The  probability  of  transitioning  to  any 
of  the  born  states  or  X  is  Independent 
of  the  current  bo.n  state  of  the  target. 


R-ua? 


Figure  3.  Transition  Diagram  for  Target 
Dynamics  States 

The  transition  diagram  associated  with  the  measurement 
process  is  shown  in  Fig.  4.  Observe  that  the  probabil¬ 
ity  of  transitioning  to  either  state  is  independent  of 
the  prior  state.  To  prevent  the  existence  of  targets 
that  have  never  been  detected,  we  assume  that  a  target 
that  is  born  in  the  current  scan  will  be  detected.  An 
alternate  way  of  defining  this  requirement  is  to  define 
the  number  of  births  parameter  (in  the  distribution 
assumed  for  births)  conditioned  on  the  event  Chat  it 
will  be  detected.  This  also  implies  that  the  number 
of  births  conditioned  on  the  event  that  it  will  not  be 
detected  13  assumed  to  be  zero. 


Figure  4.  Transition  Diagram  for 
Measurement  States 

Now  we  can  depict  the  construction  of  the  global 
hypotheses  in  any  scan.  As  mentioned,  the  track  nodes 
of  all  target  trees  are  extended  in  two  steps. 


166 


In  step  1  the  track  node  Is  split  Into  several 
ranches  -  to  account  for  each  of  the  several  dynamic 
>dels  that  the  target  can  assume.  This  Is  shown  In 
lg.  5.  Obviously,  a  parenc  track  node  that  corre- 
>onds  to  a  dead  track  is  not  split;  only  a  continua- 
lon  of  the  dead  status  is  shown  in  this  case. 


PARENT 

TRACK 

NODE 


S 


X 


DESCENDED 
TRACK  NODES 
WHICH 

ACCOUNT  FOR 
POSSIBLE 
TARGET 
OYNAMICS 


■- 1273 


Figure  5.  Track  Splitting  to  Account  for 
Different  Target  Dynamics 


In  step  2,  the  extended  track  nodes  (excluding 
rhose  which  correspond  to  dead  tracks)  are  associated 
«ith  the  measurements  received  in  that  scan.  New  track 
lodes  are  also  generated  to  account  for  the  possibility 
if  a  missed  detection.  Hence,  if  there  are  nr  returns 
In  the  scan,  then  each  of  the  track  nodes  will  have 
(l  +  nr)  descendents.  We  have  extended  the  tree  in 
Fig.  5  to  illustrate  the  effect  that  step  2  has  on  the 
track  splitting  process  for  the  case  where  nr  “  2  (Fig. 
6).  It  is  easy  to  see  that  for  the  general  case,  the 
number  of  track  descendents  for  a  maneuvering  target  is 


[l  v  (l+naXHnr)]  (7) 
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Figure  6.  Tracking  Splitting  to  Account  for  Dif¬ 
ferent  Target  Dynamics  and  Different 
Measurement  Associations 


Now  we  form  all  combinations  of  track  nodes ,  which 
are  descendents  of  parent  track  nodes  included  in  the 
parent  global  hypothesis  list,  with  the  restriction 
that  no  two  track  nodes  Included  in  a  new  global  hy¬ 
pothesis  list  should  have  the  same  parent  noae  or  use 
the  same  measurement  from  the  current  scan.  We  can 
show  that  for  a  parent  global  hypothesis  which  postu¬ 
lates  the  existence  of  nj  tracks ,  the  number  of  descen¬ 
ded  global  hypotheses  is 


ain(n»,nr)  , 

/  „  /  \i  (n 

Jo  ('*?)  (■*"-)  • 


i  (n,  +  nt  -  2i) 


nr!  nt! 


(nt-i)!  (nr-i) !  i! 


(8) 


where  as  defined  before 

Rgg  «  number  of  possible  maneuvers 
nr  -  number  of  returns  in  current  scan 
Likelihood  Computation: 

The  likelihood  of  any  descendent  global  hypothesis 
has  been  shown  to  be 


p[qk|zk]  ■  —  [P(qk|qk“^ ,  *k~l)  p(z(k)|qk,  zk~!) 
i  ~  Ck  i  1  ~ 


F(qk-l|,k-l)] 


(9) 


where  the  subscript  1  denotes  the  specific  hypothesis. 
Since  the  likelihoods  are  used  as  a  basis  for  comparing 
the  various  global  hypotheses,  we  can  Ignore  the  denom¬ 
inator  -  it  being  the  same  for  all.  The  first  term  in 
the  numerator  represents  probabilities  of  transition¬ 
ing  from  the  parent  global  hypothesis  to  each  of  the 
descended  global  hypotheses.  Posterior  likelihoods  of 
these  tracks,  after  associating  them  with  the  different 
measurements  available  in  the  scan,  are  represented  by 
the  second  term.  Finally,  the  laat  term  is  the  likeli¬ 
hood  of  the  parent  global  hypothesis. 

If  the  likelihood  of  a  false  alarm  is  normalized 
to  unity,  the  remaining  measurement  association  likeli¬ 
hoods  can  be  scaled  accordingly.  In  such  a  case,  we 
need  only  consider  the  likelihoods  for  the  track  nodes 
shown  in  Fig.  6  for  each  of  the  targets.  This  makes 
it  possible  to  compute  the  likelihood  of  a  descendent 
global  hypothesis  following  the  same  steps  used  for 
constructing  it. 

The  state  transition  diagram  for  the  target  dy¬ 
namics  (Fig.  5)  defines  the  transition  probabilities 
between  different  target  states.  The  posterior  likeli¬ 
hoods  of  the  measurement  associations  can  be  obtained 
from  a  Kalman  filter  after  being  premultiplied  by  the 
probability  of  detection  Pp.  The  tracks  which  are  pos¬ 
tulated  as  being  missed  are  multiplied  by  (1-Pp)  only. 
Thus,  the  likelihoods  of  each  of  the  descendent  track 
nodes  can  be  computed.  Then,  after  the  proper  descen¬ 
dent  track  nodes  have  been  selected,  the  likelihood  of 
the  descendent  global  hypothesis  can  be  computed  as  a 
product  of  the  likelihood  of  the  parent  global  hypoth¬ 
esis  and  the  likelihoods  of  all  the  descendent  track 
nudes  Included  in  it. 


4.  Suboptlmal  Techniques 

The  main  purpose  for  designing  suboptlmal  tech¬ 
niques  Is  to  reduce  the  computational  burden  associated 
with  the  optimal  algorithm.  Within  the  context  of  the 
optimal  algorithm  that  we  have  constructed  above,  the 
computational  burden  can  be  reduced  by  either  discard¬ 
ing  some  of  the  unlikely  global  hypotheses  or  using 
some  computationally  simpler  algorithm  for  estimating 
the  continuous-valued  states.  We  will  discuss  only 
the  former;  several  standard  suboptlmal  techniques  for 
the  latter  can  be  found  in  the  literature  (e.g.,  a-g 
tracker,  constant  gain  Kalman  filter). 

Techniques  available  for  reducing  the  number  of 
global  hypotheses  can  be  grouped  into  one  of  the 
following 

1.  Screening 

2.  Pruning 

3.  Merging 

4.  Clustering 

Both  screening  and  pruning  use  the  likelihoods  to  de¬ 
termine  whether  hypotheses  may  be  discarded.  Merging 
corresponds  to  the  process  of  combining  similar  hypoth¬ 
eses.  Grouping  hypotheses  in  order  to  process  them 
independently  is  referred  to  as  Clustering.  Since  the 
optimal  algorithm,  described  in  Section  3,  constructs 
the  global  hypotheses  in  two  stages,  these  hypotheses 
reducing  techniques  can  be  applied  during  either  the 
track  expansion  process  or  the  global  hypotheses  build¬ 
ing  stage. 

Screening  techniques  prevent  less  likely  hypoth¬ 
eses  flora  being  formed  or  discard  them  after  they  are 
partially  formed.  We  nave  incorporated  several  such 
options  in  the  optimal  algorithm.  The  first  one  is 
that  of  creating  gates  around  track  nodes  and  testing 
whether  a  measurement  falls  within  this  gate  prior  to 
forming  a  new  descendent  track  node.  Since  the  gate 
sizes  tend  to  be  large  at  the  time  of  track  initia¬ 
tion,  an  additional  screening  option  has  been  provided. 
This  is  to  rot  initiate  maneuvers  in  target  dynamics 
until  its  track  is  "well  established."  By  well  estab¬ 
lished  tracks  we  mean  tracks  for  which  the  velocity  un¬ 
certainty  is  below  a  certain  threshold.  This  screen¬ 
ing  technique  will  prevent  the  inclusion  of  tracks 
which  postulate  maneuvers  as  a  consequence  of  the  large 
gate  sizes  at  the  time  of  track  initiation.  If  a  tar¬ 
get  were  to  maneuver  at  the  time  of  birth,  it  will  be 
picked  up  as  a  new  target  with  little  loss  in  informa¬ 
tion  caused  by  dropping  its  previous  track. 

Two  other  screening  options  that  we  have  intro¬ 
duced  are  based  on  Che  physical  limitations  of  the  tar¬ 
get.  One  takes  into  account  the  finite  velocities  that 
a  target  can  have;  the  initial  uncertainty  of  velocity 
states  has  been  chosen  to  reflect  this.  The  other  op¬ 
tion  takes  into  account  the  finite  accelerations  that 
are  feasible  for  a  target;  we  have  restricted  the  tar¬ 
get  from  executing  several  different  maneuvers  in  suc¬ 
cession.  In  terras  of  the  transition  diagram  shown  in 
Fig.  3,  this  restriction  implies  that  once  a  target 
enters  a  maneuver  state,  it  can  either  remain  in  that 
state  or  return  Co  the  constant  velocity  state. 

Pruning  techniques  discard  hypotheses  after  they 
arc  formed.  It  can  he  affected  in  two  ways:  either 
deleting  hypotheses  which  have  a  likelihood  below  a 
certain  threshold  or  by  limiting  the  global  hypotheses 
at  any  stage  to  a  fixed  number.  The  former  is  diffi¬ 
cult  to  design  since  the  threshold  will,  in  general,  be 


time  varying.  The  latter  technique  is  simpler  to  im¬ 
plement  since  there  are  no  thresholds  to  be  designed. 
Ue  have  Incorporated  the  second  option  into  our 
algorithm. 

5.  Simulation  Results 

Due  to  the  large  computational  requirements  of 
the  optimal  algorithm,  it  is  not  feasible  to  run  any 
test  scenario  for  more  than  2  or  3  scans.  Hence,  we 
have  run  the  algorithm  with  both  screening  and  pruning 
options,  discussed  in  Section  4,  in  effect.  Two  test 
cases  that  were  simulated  are  described  below. 

Test  Case  I 

Ue  have  considered  Che  simple  case  of  a  single 
target  having  the  trajectory  shown  (indicated  by  the 
continuous  line)  in  Fig.  7.  The  target  starts  with  a 
heading  of  30°.  At  scan  5,  it  executes  a  maneuver 
(-30°  turn)  and  thereafter  maintains  a  heading  of  0°. 
We  have  generated  clutter  at  every  scan  represented 
by  Os).  Figure  7  indicates  the  location  of  clutter 
at  each  scan. 


■ —  X  (tot)  MW 

Figure  7.  Test  Case  1,  Rank  1  Global  Hypothesis 

We  have  summarized  the  simulation  parameters  in 
Table  1  and  the  tracking  algorithm  parameters  in 
Table  2.  The  heuristics  that  have  been  used  to  reduce 
the  computational  requirements  are  given  in  Table  3. 
Using  the  measurement  noise  specified  in  Table  1,  it 
can  be  shown  that  the  uncertainty  in  the  velocity  es¬ 
timates  will  be  reduced  to  less  than  10  m/sec  after  5 
scans.  This  ensures  that  the  heuristic  that  initiates 
maneuver  hypotheses  only  after  tracks  are  well  estab¬ 
lished,  will  postulate  maneuvers  for  the  target  prior 
to  the  actual  maneuver  at  scan  5. 

Figure  7  also  shows  the  trajectories  postulated 
by  the  global  hypothesis  with  the  highest  likelihood 
(highest  rank).  It  can  be  seen  that  it  identifies  the 
correct  target  trajectory  (dotted  line  through  the  As). 
However,  it  postulates  the  existence  of  another  target 
(dotted  line  through  the  Os)  that  is  born,  detected, 
not  detected  and  dead  in  successive  scans  starting  with 
the  fourth  scan.  This  is  a  consequence  of  the  large 
gates  associated  with  the  targets  that  are  just  born. 

At  scan  9  such  a  track  is  insignificant  and  hence  can 
be  ignored. 
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TABLE  1.  SIMULATION  PARAMETERS 


Scan  Tlae  T:  10  sees. 

Number  of  Scans:  9 

Surveillance  Area:  -15000  <  x  <  15000 
-15000  <  y  <  15000 
Measurement  Noise:  ox  “  30  ■ 


Oy  *  300  a 

Initial  Velocity  of  Target: 
Target  Velocity  after  5th  Scan: 


(Speed:  300  m/'s  ,  30°  heading) 
(Speed:  300  a/s  ,  0*  heading) 


Clutter:  1  per  scan 
Uniform  between 


(xt(k)  -  1500)  and  (xt(k)  +  1500) 
(yt(k)  -  15000)  and  (yt(k)  +  15000) 


TABLE  2.  PARAMETERS  USED  IN  TRACKING  ALGORITHM 


Initial  Filter  Covariance:  Diag  [pu»  P22,  P33«  P4I»] 

p  ,  p22:  Set  based  on  position  measurement  uncertainty 

p  1/2  m  p  1/2  «  200  a/ sec 
33  "*4 

Model  Uncertainty;  Diag  [q1Jt  q22,  q33,  q4l)] 

q  1/2  m  q  1/2  a  0 
11  22 

q  1/2  a  q  1/2  •  5  u/s  :c 
3i  44 

Measurement  Noise  Uncertainty 
ox  -  30  a 
Oy  “  300  a 

Dynamic  Model  of  Target 


T 

0 

cos  eB 
-a,  sin  8„ 


0 

T 

an  sin  6„ 

an  cos  8a 


an  e  (1) 

80  e  (-30°,  0*.  +30*} 

Temporal  Distribution  for  Births:  Poisson  with  Ag  “  1E“5 

Temporal  Distribution  for  False  Alarms:  Pulsson  with  Apg  *  4.5E-10 

Probability  of  Detection:  0,998 

Probability  of  Death:  2E-4 

Probability  of  No  Maneuver:  0.8 

Probability  of  Maneuver:  0.2/n,j 
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TABLE  3.  HEURISTICS  USED 


1.  Gating:  10 

2.  Number  of  global  hypotheses  retained  at 
each  scan:  10 

3.  Max •  number  of  missed  detections  permitted 
for  a  track:  2 

4.  Maneuver  hypotheses  initiated  only  for 
well-established  tracks  fot  which: 

p  l/2  <  15  m/sec 
33 

p  l/2  <  15  m/sec 

44 

5.  After  maneuver  is  initiated,  only  transi¬ 
tions  permitted  are  either  etraight  line 
or  same  maneuver  state. 

6.  A  priori  information  about  target  position  at 
birth  is  ignored,  l.e.,  target  position  is 
initialized  based  on  measurement  data  only. 


)  l /%  m  q 

x  (0)  -  x 

u 

X 

1  1 

.  1/2-0 

x  (0)  -  y 

22 

y 

2  7| 

He  have  shown  in  Fi^s.  8  and  9  the  trajectories 
postulated  by  the  global  hypotheses  with  ranks  2  and  3. 
It  can  be  seeen  that  the  rank  2  global  hypotheses  is 
the  correct  one  -  it  postulates  only  the  correct  tra¬ 
jectory.  The  rank  3  global  hypothesis  is  almost  iden¬ 
tical  to  Che  rank  1  hypothesis.  The  difference  is  that 
the  incorrect  track  is  postulated  to  die  at  scan  6. 


trajectory  for  the  target.  Due  to  the  modeling  of  the 
target  dynamics  in  discrete  time,  however,  some  of  them 
postulate  a  maneuver  initiated  at  scan  9.  Since  the 
position  of  the  target  will  be  Influenced  by  e  maneuver 
only  in  the  next  scan,  it  is  only  then  that  the  algo¬ 
rithm  will  reject  such  incorrect  hypotheses.  As  in  the 
case  of  global  hypotheses  with  ranks  1  and  3,  we  have 
observed  that  some  of  the  remaining  global  hypotheses 
postulate  incorrect  tracks  for  short  periods  of  time. 
Since  they  are  ephemeral,  they  do  not  have  any  adverse 
effect  on  the  correct  target  trajectory.  This  feature 
of  the  algorithm,  wherein  most  of  the  hypotheses  that 
are  retained  postulate  the  correct  trajectory  with  some 
minor  differences,  illustrates  one  aspect  of  the  ro¬ 
bustness  of  the  suboptlmal  algorithm. 
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Figure  8.  Test  Case  1,  Rank  2  Global  Hypothesis 

On  examining  the  remaining  global  hypotheses  which 
are  retained  by  the  algorithm  f which  are  not  shown 
here),  we  have  observeu  they  all  postulate  the  correct 


Figure  9.  Test  Case  1,  Rank  3  Global  Hypothesis 
Test  Case  2 

In  this  test  scenario,  we  simulated  two  crossing 
targets  along  with  clutter.  The  targets  cross  at  the 
same  point  in  time.  At  that  time  one  of  the  targets 
executes  a  maneuver  too.  As  in  the  last  scenario, 
clutter  is  generated  close  to  targets.  The  true  target 
trajectories  are  Indicated  by  continuous  lines  and  the 
clutter  is  Indicated  by  Ps.  This  is  shown  in  Fig.  10. 
The  parameters  for  the  simulation  and  the  algorithm  are 
the  same  as  in  Test  Case  1  with  the  following  addition. 


Target  2  -  Initial  Position 
(-9657,  -5264) 

Speed  .-.d  Heading 
(200  m/s,  -45*) 


Ic  can  be  observed  that  the  conditions  are  partic¬ 
ularly  severe  for  the  tracking  algorithm  at  the  target 
crossing  point  where  target  1  executes  the  maneuver. 
Figure  10  traces  the  trajectories  posulatcd  by  the 
global  hypothesis  with  the  highest  likelihood  (rank  1). 
Despite  the  exactirg  requirements  of  the  scenario,  the 
algorithm  Identifies  the  correct  hypothesis  by  the  9th 
scan.  From  these  two  test  cases,  we  see  that  the  algo¬ 
rithm  performs  very  well  in  spite  of  the  heuristics 
that  have  been  introduced. 
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Figure  10.  Teat  Cate  2,  Rank  1  Global  Hypothesis 
6.  Summary  and  Conclusions 

By  formulating  the  problem  of  multlobject  tracking 
of  mareuverlng  targeta  within  the  framework  of  hybrid 
atate  estimation  and  using  a  novel  data  structure  to 
represent  the  discrete-state  hypotheses,  we  have  con¬ 
structed  t;r>  optimal  solution.  We  have  provided  alrau- 
larlon  results  which  show  the  feasibility  of  construct¬ 
ing  the  optimal  algorithm.  Since  the  optimal  solution 
requires  exponentially  growing  storage  and  computa¬ 
tional  i equlrements ,  we  have  incorporated  suitable  hy¬ 
potheses  deletion  techniques.  Such  suboptlmal  tech¬ 
niques  drastically  .educe  the  computational  require¬ 
ments  with  little  lo38  In  accuracy,  as  has  been  shown 
In  th-  simulation  results. 


here  defines  the  framework  for  implementing  algorithms 

for  state  estimation  in  a  switching  environment. 
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ABSTRACT 

■"*  This  paper  describes  a  system  for  tracking  and 
classifying  multiple  targets  using  measurements  from 
multiple  sensors.  This  system  vat  implemented  based  on 
a  general  Bayesian  theory  of  multitarget  tracking  which 
the  authors  developed  earlier.  The  related  implementa¬ 
tion  issues  in  multitarget  tracking  are  also  discussed. 

1.  INTRODUCTION  ' ! 

In  (11  and  (2]  ve  provided  solutions  to  several 
problems  which  hrd  pot  b'  ;n  solve!  in  the  existing  mul- 
titarget,  multiseusor  tr.  king  literature  (surveyed  in 
(31,  (4],  [3],  (61,  etc.).  The  problems  were:  (a) 
treatment  of  non-gaussian  target/sensor  models,  (b) 
dependence  of  detection  on  target  states,  (c)  determi¬ 
nation  of  the  likelihood  of  newly  detected  targets  (how 
to  initiate  tracks),  and  (d)  dependence  among  targets. 

As  indicated  in  (7),  (8j,  etc.,  the  first  problem  was 
less  difficult  than  the  others  which  are,  however, 
obviously  important  in  practice.  In  the  process  cf 
solving  these  problems,  we  realized  that,  despite  the 
many  algorithms  developed  in  a  vast  amount  of  existing 
literature,  there  did  not  exist  either  a  general  foun¬ 
dation  of  the  subject  or  a  unified  view  on  it.  An 
attempt  was  made  in  (9)  and  ilO]  to  fit  multitarget 
tracking  problems  into  a  subcategory  of  a  special  class 
of  dynamical  systems. 

In  (11  ana  (2],  we  presented  a  view  which  shcrply 
contrasts  with  this.  According  to  our  view,  many  ele¬ 
ments  involved  in  the  multitarget  tracking  problems 
ace  in  essence  random  sets  as  defined  in  (111,  and 
hence  the  problems  are  radically  different  from  the 
conventional  iiltering/estimation  problems.  For  example, 
targets  constitute  a  random  set  because  (in  general) 
the  number  of  targets  is  unknown,  and  there  is  no  g. 
priori  labelling.  The  returns  from  tensors  are  alto 
random  sets  because  of  the  random  number  of  the  returns 
and  the  random  ordering.  We  tentatively  call  such 
features  of  objects  random-set  nature.  Moreover,  as 
will  be  seen  later,  many  elements  (such  as  tracks  and 
hypotheses)  are  veil  understood  when  they  are  connected 
to  certain  random  sets. 

The  theory  on  random  sett,  or  stochastic  geometry, 
is  mainly  concerned  with  those  whose  realizations  are 
uncountable  seta,  e.g.,  open,  closed  or  convex  sets  id 
Euclidean  spaces,  and  is  mathematically  vety  sophisti¬ 
cated.  Fortunately,  in  the  multitarget  trackicg  prob¬ 
lems  we  are  excl.isively  concerned  with  random  sets 
whose  realizations  are  finite  (or  at  most  countable) 
sets.  Therefore  we  still  can  apply  standard  proba- 
biliatic  techniques.  Fur  example,  a  finite  random  set 
of  reals  can  be  probabilistically  completely  described 
by  specifying  the  probability  on  the  number  N  of  ele¬ 
ments  and  the  joint  probability  distributions  of  ele¬ 
ments  given  N's.  However,  in  order  to  model  the 
random-set  nature,  we  must  require  that  each  of  the 
•This  reseatch  was  supported  by  the  Defense  Advanced 
Research  Projects  Agency,  Contract  MDA-9O3-81-C-0333. 
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distributions  it  interchangeable  (permutable) .  In 
other  words,  ue  can  treat  a  random  set  as  a  random  ele¬ 
ment  in  a  disjoint  union  of  direct-product  spaces  with 
different  dimensions.  At  a  natter  of  fact,  this  was 
thu  bssic  approach  tsken  in  (1)  and  (2l.  For  this  rea- 
soi,  the  bssic  objects  sre  generally  swdeled  as  s  pair 
consisting  of  the  number-of-elenents  and  a  vector  with 
the  corresponding  dimension. 

The  main  purpose  of  this  paper  is  to  describe  a 
general  multitarget  tracking  system  (called  Generalized 
Tracker/Clsssifier  —  GTC)  which  we  implemented  bated 
uj on  our  general  theory  developed  in  11]  and  (2]  and  to 
compare  this  system  with  other  existing  algorithms.  In 
the  following  three  sections,  we  will  briefly  overview 
our  theoretical  results.  (One  should  refer  to  (1)  or  (2] 
for  a  more  rigorous  treatunt.)  Then  we  discuss  the 
implementation  issues  and  compare  the  GTC  implementa¬ 
tion  with  bther  algorithms. 

2.  GENERAL  MODEL 

A  target  it  a  generic  name  for  any  object  to  be 
tracked  and/or  classitied,  and  a  sensor  is  a  generic 
name  for  a  device  which  generates  an  undetermined 
number  of  measurements.  In  our  general  model,  targets 
are  modelejd  as  one  ertity,  i.a.,  a  pair  of  the  two  ran¬ 
dom  elements:  a  nonnegative  random  integer  N 
representing  a  constant  but  unknown  number  of  targets 
in  the  "whole  world,"  and  a  continuous-time  Markov  pro¬ 
cess  X(t).i  The  Markov  process  X(t)  is  on  different 
spaces  depending  on  the  number  of  targets  N_.  There¬ 
fore,  the  targets  are  modeled  as  an  abstract  stochastic 
process  (X(t),N^),  called  target  system  state  on  the 
countable  disjoint  union  of  spaces,  which  we  call  tar¬ 
get  system  state  space ■  Each  component  of  the  target 
system  state  space  corresponds  to  a  given  number  of 
targets  and  consists  of  (is  the  direct  product  of)  two 
spaces:  the  common  component  apace  (to  which  a  common 
state  to  all  the  targets  in  s  group  stay  be  assigned) 
and  the  individual  component  space.  To  account  for  the 
random-sit  nature  of  the  targets,  the  latter  space  must 
be  an  N^-direct-product  of  an  identical  space  and  the 
Markov  process  X(t)  must  oe  interchangeable  with 
respect  to  the  ordering  (labelling)  of  targets.  Any 
component  of  the  target  system  state  space  is  in  gen¬ 
eral  a  hybrid  set,  i.e.,  the  direct  product  of  a  subset 
of  a  Euclidean  space  (used  to  represent  "continuous" 
information  such  as  position  and  velocity)  and  a  finite 
set  (used  to  represent  "discrete"  information  such  as 
type,  aaneuvering/operating  mode,  etc.). 

One  should  not  confuse  the  constsnt  (but  unknown) 
number  of  targets  with  that  of  detected  targets,  and 
should  note  that  the  constancy  assumption  is  not  res¬ 
trictive  but  correctly  reflects  the  reality  at  least 
conceptually.  For  example,  consider  targets  whose 
states  reflect  their  birth-death  processes  (as  modeled 
in  (8)).  In  reality,  the  interval  between  the  time 
when  the  first  target  it  borr.  snd  the  time  when  the 
last  target  diet  it  finite.  The  total  number  of  tar¬ 
gets  which  do  exist  in  subintervals  is  hence  constant 
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and  finite.  In  a  sense,  the  co.icept  of  existence 
is  independent  of  the  concept  of  time.  In  practice, 
however,  we  may  have  some  difficulties  in  reflecting  3 
“constant'*  flow  of  targets  into  a  surveillance  region. 
In  almost  all  cases,  we  can  resolve  this  difficulty  by 
choosing  appropriate  a  pr ior i  initial  distributions  of 
target  states. 

In  general,  we  assume  a  multiplicity  cf  sensors, 
each  of  which  generates  sets  of  measurement s  (returns) 
at  a  finite  rate.  Each  6ensor  output  13  again  a  pair 
(y.K  )  of  two  random  element:.,  the  number  N  of  meas¬ 
urements  and  a  vector  y  of  measurements,  and  is  a  ran¬ 
dom  element  in  a' disjoint  union  of  direct-products  of 
identical  measurement  spaces .  Each  sensor  may  how¬ 
ever  have  a  different  measurement  space  which  is  in 
general,  a  hybrid  set  to  account  for  feature-type 
discrete  information  as  well  as  cont: nuous-value  obser¬ 
vations.  When  a  sensor  s  generates  an  output  (y,N  )  at 

time  t,  we  call  (y,Nu,t,e)  a  data  set  and  (t,s)  a  data 
n 

set  index . 

Without  loss  of  generality,  ve  can  assume  that  any 
sensor  generates  at  most  one, output  at  a  time.  One  of 
the  key  steps  in  our  development  is  to  model  a  mechan¬ 
ism  which  relates  the  target  system  state  (state-of- 
vorld)  to  the  sensor  outputs  (observables).  The  first 
assumption  is  that  the  sensor  source  s  and  the  timing  t 
of  its  output  arc  exactly  known  and  are  independent  of 
the  targets.  Therefore,  even  when  uata  set  indices 
(t,s)'s  are  random,  we  can  treat  them  as  constant  and 
known  quantities,  i.e  ,  all  the  sensor  schedules  are 
predetermined.  Let  J(  be  the  set  of  all  the  data  set 
indices  (t,s)'s. 


is  the  total  amount  of  information  accumulates  at  k. 

Our  problem  is  to  calculate  the  probability  distribu¬ 
tion  of  the  target  system  state  (x(t),N_)  given  the 
cumulative  date  set  z'  up  to  k  *  (t,s)  in  J(,  in 
terms  of  (i)  P(x(t )  |NT,Z  U3 )  and  (ii)  P(HT|ZU,>. 

Since  the  origin  of  each  measurement  in  each  data  set 
is  random  and  there  is  no  a  or ior i  labelling,  we  cannot 
directly  calculate  such  conditional  probability  (i) 
described  above.  Therefore  we  must  "hypothesize"  the 
origin  of  each  measurement,  e"aluate  each  “hypothesis" 
and  calculate  the  target  system  state  conditional  dis¬ 
tribution  given  each  hypothesis,  which  is  in  fact  a 
common  procedure  in  multitarget  tracking,  hypotheses 
as  well  as  tracks  are  the  moat  frequently  used  termino¬ 
logies  but  often  only  vaguely  defined.  Our  definition 
of  hypotheses  and  tracks  ia  closeiy  related  to 
Morefield's  notion  in  [ 1 2 J  but  diifers  from  it  in  that 
ours  are  defined  exclusively  on  tie  number-of- 
measurements  information. 


(k) 

Let  k  be  a  data  set  index  in  X  and  J  be  the 

union  of  all  the  sets  (1 . N^(k" ) }X{k' }  for  all  k'  up 

to  k  where  N  (k')  is  the  number  of  measurements  in  the 

"  (k) 

data  set  indexed  by  k"  .  Each  element  (j,t,s)  in  J 

represents  the  j-th  measurement  in  the  data  aet, gen¬ 
erated  by  sensor  s  at  time  t.  The  random  set  J'  'it 
called  the  cumulative  measurement  index  up  to  k.  A 
track  at  k  is  a  subset  of  the  cumul.'.ive  measurement 
index  set  up  to  k  and  a  hypothesis  a.,  k  ia  a  collection 
of  nonempty  tracka  at  k.  A  track  it  aaid  to  be  possi¬ 
ble  if  it  contains  at  most  one  measurement  index  in  one 
data  set.  A  hypothesis  ia  said  to  be  poaaible  it  it 
contains  only  possible  tracka  and  no  .wo  tracks  in  it 
intersect.  Define  a  random  set  A(k.  by 


With  the  standacl  assumption  of  no  split  measure¬ 
ments  and  no  merged  measurements,  for  each  k  »  (t,s)  in 
J( ,  sensor  output  (y,NM)  is  modeled  as  follows:  We 
assume  a  random  subset  I  of  the  target  index  set 
X  »(1,...,NT>  such  that  »( Ip)  <  N  (In  this  paper,  #(E) 
is  the  number  of  elements  in  a  set  E.)  and  a  random 


one-to-one  function  A  defined  on  I„  taking  values  in 
the  set  J  “(1, . . . ,N„).  The  random"  subset  1^  is  called 
the  (index)  set  of^  detected  target s  and  the  random 
function  A  is  called  assignment  function  at  k.  i€Ip  and 
j-A(i)  mean",  that  the  i-th  target  is  detected  at  k  and 
the  j-th  measurement  originates  from  it.  When  j  in  J,. 
is  not  in  the  image  of  A,  we  call  the  j-th  measurement 


false  alarm. 


Finally,  we  assume  that  every  data  set  is  condi¬ 
tionally  independent  given  the  target  state.  With  this 
assumption,  we  can  completely  describe  the  sensor  model 
by,  for  each  (t,B)  in  J(  ,  specifying  ti)  the  probabil¬ 
ity  of  the  random  subaet  I.  of  1^  given  N^.,  (ii)  the 
probability  distribution  of  the  number  N^  of  measure¬ 
ments  (or  equivalently  the  number  of  false  alarms) 
and  (iii)  the  probability  density  function  of  the  meas¬ 
urement  vector  given  1^,  N1(  and  the  random  assignment 
A.  All  of  the  above  probabilities,  ditributior.s  and 
densities  are,  in  general,  conditioned  by  the  target 
system  state.  In  particular,  detection  of  a  target  may 
depend  on  its  state.  To  reflect  the  raudoa-set  nature, 
each  of  the  conditional  probabilities,  etc.,  in 

the  above  (i)  ~  (iii)  must  be  interchangeable  with 
respect  to  the  ordering  of  the  targets. 


3.  GENERAL  BAYESIAN  FORMULA 


Let  2  be  the  set  of  all  the  data  sets  and  ,X  be 
the  data  set  index  set.  Since  evr cy  sensor  generates 
outputs  at  a  finite  rate  and  it  sends  at  most  one  data 
set  at  a  time,  J(  is  at  most  countable  and  is  one-to- 
one  to  2  .  ...  can  be  enumerated  in  accordance  with 

time  in  such  a  way  that  .X  and  2  are  totally  ordered 
so  that  the  time  components  are  in  a  natural  order^j 
Thus,  for  each  k  in  X,  the  cumulative  data  set  Z'  up 
to  k  is  the  collection  of  all  the  data  sets  up  to  k  and 


A(k>  -  {{(A(k')(i),k')lk-£Klk).i«lDtk'))li<l„k))  (1) 

where  K(k)  is  the  set  of  data  set  ind.cet  prior  to  and 
including  k,  IQ(k')  is  the  set  of  detected  targets  in 
the  data  set  indexed  by  k' ,  A(k' )  ia  the  random  assign- 

(k) 

ment  in  the  same  data  set,  and  1^  is  the  cumulative 
set  of  detected  targets  up  to  k,  i.e.,  the  union  of  all 
the  Ip(k')  tor  k'  in  K(k).  Due  to  our  no-split/no- 
merge  assumption,  it  is  clea-  that  the  summation  of 

Prob.<  A(k)*A  IJ  )  over  all  the  possible  hypotheses 
X  at  k  must  be  one.  When  )  is  a  possible 

hypothesis,  in  the  event  (  A(k)“ X },  there  are  n  tar¬ 
gets  which  are  detected  at  least  once  up  to  k  and  all 
the  measurement  indices  in  each  T.  originate  from  the 
i-th  detected  target. 

Since  the  righ'.-hand  side  of  (1)  is  a  aet  opera- 
(k) 

tion  and  the  random  set  Ip  is  not  accessible,  we  must 

'hypothesize"  the  origin  (:u  Ij)  of  each  element  in 
A(k),  i.e.,  a  one-to-one  random  function  ft(k)  from 
A(k)  into  1_.  Therefore,  we  must  calculate  (i) 

P(N  I  A(k),Z*k’)),  (ii)  P(  ft  ( k )  I N  ,  A(k/,Z(k))  and  (iii) 

A  /  y \  A 

P(X(t)l  ft(k),N^,  A(k),Zl  ').  Due  to  our  interchangea¬ 
bility  condition,  however,  ve  can  show  that  the  proba¬ 
bility  (ii)  is  the  same  for  all  the  possible  realiza¬ 
tion  of  ft  (k)  and  that  the  state  distribution  (iii)  is 
invariant  under  any  realization  of  ft(k).  Thus,  we 
need  only  calculate  (i)  and  (iii),  which  can  be  done 
recursively  as  shown  below:  Let  k  ■  (t,s)  be  any  data 
set  index  in -X  ,  (y,N  k)  the  last  (current)  data  set 
"  (k) 

in  the  cumulative  data  set  Z“Z  and 

Z*.Z^*^«Z \((y,NH,K)),  where  k  *  (t,s)  in  ,X  is  the 

immediate  predecessor  of  k  and  \  is  the  set  substrac- 
tiun  o-eration.  Then,  with  A»  A(k)  and 

A={  t  ,  k^|l£A)\{'>},  vie  have 
M 
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P(A|Z) 


where  fi  it  to  tppropritte  Betture  on  the  measurement 
value  apace  of  aenaor  a. 


P(AIZ)  (Nm-  nD(Alk))l 
P(2  |Z )  Nu  ] 


2,  (N_  -  f(A) )  I 

>  — -  P(NTIA,Z)X(y,NjNT,A) 

(N_  -  #(A))I  M  T 

-♦(A)  T  (2) 


Nj-PU) 


where  n^iXlk)  ia  the  number  of  detected  targeta  at  k 
when  hypotheaized  by  X  and  X(y.NMlbT,X)  it  the  likel¬ 
ihood  of  the  current  aenaor  output  (y.N^)  under  assump- 
tion  (N^.X).  The  actual  form  of  X  ia  rather  compli¬ 
cated  and  ia  omitted  in  thia  paper  (Refer  to  [1]  or 
[2]).  Then,  to  complete  the  recuraion,  the  target  sys¬ 
tem  atate  diatribution  conditioned  on  (Z'  ',  A(k), 
ft(k))  mutt  be  calculated  at  well  at  the  conditional 
probability  on  accoroing  to  the  updating  formulae 
ahown  in  [1]  and1 [2]. 

4.  INDEPENDENT,  IDENTICALLY  DISTRIBUTED  CASES 

Although  the  general  formulation  deacribed  in  the 
previoua  aection  givea  ua  a  sound  foundation  of  our 
theory  of  multit'.rget  tracking,  itt  implementation  may 
poae  a  terioua  problem  becauae  evaluation  of  hypotheaea 
involvea  all  the  poatible  pairt  (X  ,N^).  However,  with 
the  additional  independence  aaaumptiona  mentioned 
below,  we  can  reduce  the  general  formulr  to  a  form 
which  is  more  implementationally  feasible.  He  call 
modelt  which  satisfy  these  assumptions  independent  and 
identically  distributed  (i.i.d.)  models.  With  these 
assumptions,  applicable  models  are  more  limited;  for 
example,  we  cannot  allow  dependence  among  targeta. 
However,  thia  class  of  models  ia  still  general  enough 
to  include  any  model  in  the  existing  nultitrrget 
literature. 

First  we  assume  that,  given  the  total  number  N_  of 
targeta,  the  targets  arc  repreaented  by  a  system  of1 
i.i.d.  Harkov  processes  where  the  target  system  atate 
space  consists  only  of  the  individual  target  system 
space.  The  a  priori  diatribution  of  the  number  N_  of 
targets  ia  Poisson  with  mean  vQ.  The  event  (i€lD(k)) 
in  which  the  i-th  target  is  detected  in  the  data  aet 
indexed  by  k  -  (t,s)  it  target-wise  conditionally 
independent  and  depends  only  on  its  own  state  X.(t). 
Thus,  we  have  1 

Prob.U€ID<k)IX(t),NT)  -  pD(Xi<c)  Ik) 

with  a  given  detection  probability  function  p-( . |k) . 

We  should  note  again  that  p^C.lk)  is  a  conditional  pro¬ 
bability  so  that  the  probability  of  the  i-th  target 
being  detected  is  described  as 

Prob.U«  ID(k)lNT}  -  y‘pD(Xi(t)lk)p(dX.(t)lNI) 
which  one  should  not  contuse  with  p^(  I*;  itself. 

The  number  of  false  alarms  in  the  data  set  indexed 
by  any  k  ia  independent  of  the  target  states  and  of 
other  data  seta,  and  is  determined  by  a  given  distribu- 
tion  (.Ik)  for  each  k.  Given  the  aet  Ip(k)  of 

detecte£Atargets,  the  number  of  false  alarms  (hence, 
the  number  N (k)  of  measurements)  and  the  assignment 
function  A(k7,  the  system  of  measurements  values 
y-(yj)  j€(l  N  j  i»  target-wise  conditionally  indepen¬ 

dent  and,  in  particular,  the  system  of  false  alarm 
measurements  is  i.i.d.  with  a  common  density  p_, (,|k). 
When  j»A(k)(i),  the  probability  density  of  y.  is  given 
by  a  usual  state-measurement  transition  probability 

density  function  p  (y.lX.k),  i.e., 
n  J  i 

Prob. (y  .  <  dy  I  j»A(k) (i),X. (t))  »  p„(y . IX  (t),k)M  (dy) 

J  1  n  j  i  s 


With  the  above  additional  assumptions,  the  general 
formula  (2)  in  the  previous  section  can  be  reduced  to: 

P(AIZ)  -  C'WlZ)  L^fy.NjjlA.k)  L(Y(r,k)  |r,k) 
where  T<A  <3> 

C  -  P(Z|2)  (NJ)  exp(l»(k)(l  -  L(G !d,k) )  (4) 

is  the  normalizing  constant, 

LFA(y,NMIA,k>  -  (Nfa.)  p  <NFAlk)  p|  pFA(y.lk)  (5) 

*JFA 

is  the  false  alarm  likelihood  with  N  being  the  number 
of  false  alarms  and  JpA  being  the  set  of  false  alarms, 
and  Y(T,k)  is  the  measurement  assigned  by  (symboli¬ 
cally  Y(T,k)*0  if  r  is  not  assigned  any  measurement  at 
k).  Each  of  the  remaining  factors  L(ylr,k)  in  (3)  is 
generally  called  the  track-to-measurement  likelihood. 
They  are  defined  as: 


(i)  the  likelihood  of  measurement  y  originating 
from  a  target  detected  before: 

L(y  lT,k)  »  ypM(y|x,k)pD(xlk)p(dx|T,ZVk)  (6) 

(ii)  the  likelihood  of  a  target  detected  before 
being  undetected: 

LOlT.k)  -  y (1  -  pD(xIk))p(dx|r,2,k)  (7) 

(iii)  the  likelihood  of  measurement  y 
originating  from  a  newly  detected  target: 

L(ylt,k)  -  |/(k)  f  p  (ylx,k)pn(nlk)p(dxl0,Z,k) 

J  ”  D  (8) 

(iv)  the  likelihood  of  an  undetected  target 
remaining  undetected: 

L(0lf8,k)  -  J(,  1  -  pD(xlk))p(dx(sl,Z,k)  (9) 

In  (4)  and  (8),  l/(k)-E(N  |A,Z)  -  *(A)  is  the  expected 
number  of  undetected  targets  up  to  k.  In  (6)  ~  (9), 
p ( .  It, Z_, k )  is  the  target  state  distribution  at  t  given 
track  r  and  cumulative  data  set "z. 

-he  updating  from  l/(k)  to  v(k)  and  from  p(.|T,Z,k) 
to  p(.|T,Z,k)  can  be  done  by  »/(k)  -  L(0ld,k)5/(E)  and 


p(dx|T,Z,k)  - 


d  pM(y|x,k)pn(xlk)p(dx|r,Z,k) 
if  Y(r,k)-y*0 

d'  *(1  -  Pq(x Ik) )p(dx I T,Z ,k; 


otherwise 


with  normalizing  constant  d  or  d'. 

When  ve  impose  the  additional  assumption  that  the 
distribution  p  (.Ik)  of  the  number  of  false  alarm  is 
rA 

Poisson  with  mean  WFA<k)  for.  each  k  in  .X  ,  we  have  the 
following  simple  formula  by  repeatedly  applying  the 
recursive  formula  (3): 

P(AIZ)  -  c'1  p|  i(r,z)  (11) 

rf.‘. 


where  l(T\Z)  is  the  likelihood  of  track  T.  which  can  be 
recursively  calculated  by 

!M>  HI  -  e 

Z(TlZ')  L(ylT,k')/l\.(ylk' ) 

A  if  Y(r,k')“y/6 

Z(T|Z')  L(G I T,k' )  otherwise  (12) 

and 

t\ACy  Ik'  >  *  l>FA(ylk'  }PFA(y)  (13) 

tor  each  k' ,  Z'“Z^k  ^  and^'“Z^  ^  with  E'  being  the 
immediate  predecessor  of  k'. 

S.  GENERALIZED  TRACKER/CLASSIFIER 

Vie  have  implemented  a  general  algorithm  described 
in  the  previous  two  sections  in  the  form  of  a  system 
called  Generalised  Tracker /Classifier  (GTC).  GTC  is 
intended  to  be  used  in  performance  analysis  of  multi¬ 
target  tracking  systems  by  Monte  Carlo  simulation  and 
to  implement  all  the  problem-independent  parts  of  the 
general  algorithm.  At  the  current  stage,  GTC  can  only 
handle  models  for  which  all  the  assumptions  made  in 
Section  4  are  satisfied  in  addition  to  those  made  in 
Section  3.  Equ.  (3)  was  chosen  for  the  GTC's  basic 
equation.  The  functions  in  GTC  can  be  divided  into 
the  following  three  major  components: 

1.  hypothesis  generation 

2.  hypothesis  evaluation,  and 

3.  hypothesis  management 

In  hypothesis  generation  all  the  tracks  and 
measurements  are  first  cross-referenced  and  all  the 
likelihood  functions  appearing  Equation  (3)  are  calcu¬ 
lated.  These  functions  are  tabulated  into  a  table 
called  the  track-measurement  cross-r.:terer.ce  table. 

Then  all  the  old  hypotheses  and  tracks  are  expanded  to 
include  the  new  measurement  assignments  by  a 
measurement-oriented  tree  expansion  technique  described 
in  (I4j,  In  hypothesis  evaluation,  all  the  new 
hypotheses  are  evaluated  using  Equation  (3)  and  all  the 
new  tracks  are  assigned  with  the  updated  target  state 
distt ibutions  using  Equation  (10). 

As  is  well  known  the  number  of  possible 
hypotheses  and  that  of  possible  tracks  grow  very 
rapidly  as  the  data  sets  are  accumulated.  In  fact  we 
found  that  this  growth  is  worse  than  exponential  in 
most  cases.  Therefore,  as  in  any  implementation  of  any 
multi-hypothesis  system,  we  must  have  reasonably 
powerful  hypothesis  management  techniques  to  control 
the  number  of  hypotheses  and  tracks.  The  hypothesis 
management  techniques  incorporated  into  GTC  can  be 
categorized  as  follows: 

(a)  hypothesis  pruning 

(b)  hypothesis  combining 

(c)  windowing  and 

(d)  clustering 

Each  of  tne  above  categories  will  be  discussed  in  the 
next  section. 

Since  the  data  used  in  GTC  are  structured  and  the 
generally  large  memory  space  is  required,  the  choice  of 
programming  language  and  the  dynamical  memory  alloca¬ 
tion  are  crucial  issues.  The  first  version  of  GTC  was 
implemented  in  SAIL.  This  version  was  tested  in 
several  different  target/sensor  models,  from  very  sim¬ 
ple  ones  to  complicated  models  with  state-dependent 
detection  probabilities  and  target  classification. 
Currently,  implementations  in  tvo  other  versions  in  C 
and  LISP  are  under  way. 


6.  HYPOTHESIS  MANAGEMENT  TECHNIQUES 

Unfortunately  there  i*  no  hypothesis  maragement 
theory  applicable  to  general  oultitarget  tracking  prob¬ 
lems.  To  date  however  many  hypothesis  management 
techniques  have  been  proposed  and  tested.  In  this  sec¬ 
tion  we  will  disease  these  techniques  and  describe 
vhat  techniques  we  have  chosen  for  the  GTC  and  why.  We 
will  describe  these  techniques  according  to  the  four 
categories  of  hypothesis  management  described  in  the 
previous  section- 

(a)  Hypothesis  Pruning:  The  existing  pruning  tech¬ 
niques  are  further  categorised  as  (al)  t ixed-threshold 
pruning  in  which  insignificant  hypotheses  are  cut  and 
(a2)  f ixed-breadth  pruning  in  which  the  number 
(breadth)  of  hypotheses  is  limited.  In  [13]  fixed- 
threshold  pruning  (al)  was  proposed  in  which  any 
hypothesis  whose  probability  falls  below  a  predeter¬ 
mined  level  is  pruned.  In  this  method  however  no 
consideration  of  the  computing  resource  or  of  the 
external  conditions  is  made.  Typically,  the  uncer¬ 
tainty  of  the  origin  of  each  measurement  is  not 
resolved  in  the  first  few  data  sets  and  is  generally 
resolved  as  more  data  sets  are  accumulated.  In  such  a 
case,  we  may  want  to  keep  more  hypotheses  at  earlier 
stages  than  later  stages.  On  the  other  hand,  how  many 
hypotheses  can  be  stored  is  actually  determined  by  the 
available  computer  resource.  From  this  point  of  view 
fixed-breadth  pruning  (a2)  proposed  in  [14)  makes  more 
sense.  In  this  method  the  hypotheses  are  ordered  by 
their  probabilities  and,  for  a  given  fixed  breadth  N, 
at  most  N  best  hypotheses  are  kept.  In  an  extreme 
implementation  of  this  technique  N  is  one,  i.e.,  only 
the  best  hypothesis  is  kept.  Such  a  method  is  called 
zero-scan  algorithm  in  [13].  Unfortunately  this  method 
loses  its  rationale  when  we  use  any  form  of  clustering 
described  later  in  this  section,  since  some  clusters 
may  need  more  breadth  than  others  under  certain  condi¬ 
tions  and  there  is  no  intelligent  way  of  allocating  the 
resource  among  clusters. 

In  order  to  compensate  for  these  shortcomings,  we 
have  introduced  a  new  pruning  technique  called 
adapt ive- threshold/ adaptive-breadth  pruning  In  this 
method,  all  the  hypotheses  are  ordered  according  to 
their  probabilities  and  the  cumulative  probabilities 
from  the  best  hypothesis  are  calculated:  when  the  cumu¬ 
lative  probability  exceeds  a  predetermined  threshold, 
the  remaining  low-probability  hypotheses  are  discarded. 
One  can  view  the  fixed-threshold  pruning  a6  an 
adaptive-breadth-pruning  and  vice  versa.  This  new  prun¬ 
ing  method,  however,  possesses  overall  adaptivity  in 
which  the  breadth  in  each  cluster  is  adjusted  according 
to  the  complexity  of  the  measurement  data  in  the  clus¬ 
ter  snd  the  computer  resource  is  adequately  allocated 
among  clusters. 

(b)  Hypothesis  Combining:  When  the  measurement 
data  are  confusing,  we  may  have  a  large  number  of  simi¬ 
lar  hypotheses  with  small  probabilities.  In  such  a 
case,  unless  we  combine  similar  hypotheses,  either  of 
the  prunmg  methods  described  above  ould  tail  result¬ 
ing  in  the  loss  of  ‘'important.1'  hypotheses.  Since  the 
set  of  all  the  possible  hypotheses  is  a  partition  of  a 
probability  space  with  respect  to  the  uncertainty  of 
the  origin  of  each  measurement  in  the  past,  to  combine 
hypotheses  is  actually  to  coarsen  this  partition.  If 
the  partition  becomes  too  coarse,  the  performance  of 
the  tracker,  may  be  greatly  degraded.  Therefor*  a  com¬ 
mon  strategy  in  hypothesis  combining  is  to  combine  only 
"similar*’  hypotheses.  There  are  two  well  documented 
hypothesis  combining  methods:  lb))  tareet-state- 
oriented  combining  described  in  [13)  and  (b2) 
measurement-index-or iented  combining  originally  pro¬ 
posed  in  116).  In  the  first  method,  two  hypothes  s  are 
similar  and  hence  to  be  combined  if  they  have  an  ident¬ 
ical  number  of  tracks  and  each  track  in  a  hypothesis 
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ha*  a  unique  similar  track  in  the  other  hypothesis. 

Two  tracks  are  similar  if  their  state  distributions  are 
similar.  In  [17]  a  criterion  for  testing  the  similar¬ 
ity  of  two  gaussian  distributions  was  proposed.  In 
general  case*  however  the  distribution  similarity 
should  be  carefully  considered  based  on  the  particular 
target/sensor  models.  When  two  hypotheses  are  com¬ 
bined,  each  pair  of  siuilar  tracks  is  combined  and  the 
state  distributions  are  combined  with  the  hypothesis 
probabilities  as  weights. 

Heasurement-index-oriented  combining  (b2)  is  actu¬ 
ally  a  classic  method  proposed  almost  a  decade  ago.  In 
this  method,  for  a  given  M,  two  tracks  with  the  same 
measurement  assignment  in  the  most  recent  H  data  sets 
are  identified.  When  tracks  are  identified  in  this 
way,  we  may  have  sets  of  identical  hypotheses  which  can 
be  combined  naturally.  This  method  is  sometimes  called 
the  M-scan  or  deoth-M  algorithm.  This  scheme  may  not 
be  adequate  when  the  arrival  of  data  sets  is  irregular 
or  sensor  characteristics  vary  widely  from  sensor  to 
sensor.  Horeover,  when  two  tracks  are  combined,  there 
is  no  adequate  weighting.  For  this  reason  we  have 
chosen  the  target-state-oriented  combining  in  which  the 
determination  of  two  similar  tracks  and  the  actual 
track  combining  are  performed  by  a  user-provided  exter¬ 
nal  routine  outside  the  GTC. 


As  an  extreme  example  of  combining,  in  the  JPDA 
algorithm  described  in  [15]  all  the  hypotheses  are  com¬ 
bined  at  each  stage  so  that  only  one  hypothesis  is  kept 
for  the  next  st'ge.  It  is  clear,  however,  that  this  is 
possible  only  when  we  have  a  priori  probability-one 
hypothesis  with  a  or iori  tracks  and  there  is  no  newly 
detected  target. 


(c)  Windowing:  When  a  track  state  distribution  has 
a  reasonable  variance  and  measurement  error*  are  not 
exceptionally  large  with  respect  to  the  f ield-of-view 
of  a  sensor,  one  can  expect  the  track-measurement 
likelihood  defined  by  (6)  to  be  very  small  except  for  a 
limited  region.  Windowing  techniques  are  generally 
designed  to  set  an  appropriate  threshold  on  the  likeli¬ 
hood  function  so  that  its  value  outside  this  region  is 
considered  zero  rather  than  a  very  small  but  still 
positive  number,  thereby  eliminating  unnecessary 
hypothesis  expansion.  In  other  words,  windowing  is 
screening  of  data  to  determine  which  ones  w?  should  try 
to  associate  to  a  given  track.  For  this  reason,  win¬ 
dowing  is  often  called  data  validation  and,  for  each 
track,  the  set  of  measurements  in  which  the  likelihood 
function  is  not  zero  is  called  the  validation  region.. 
When  the  track  state  distribution  and  the  measuremei  t 
error  are  both  gaussian,  threshold  windowing  by  the 
gaussian  likelihood  functions  corresponds  exactly  to 
classical  X*-  or  extended  X^-test. 


One  also  may  view  windowing  as  immediate  pruning 
in  which  a  branch  is  cut  solely  based  on  one  likelihood 
function.  However,  since  hypothesis  eval  ation  cannot 
be  completed  until  all  the  hypotheses  are  expanded  and 
their  probabilities  are  obtained  by  normalization,  any 
intermediate  pruning  may  degrade  the  tracking  perfor¬ 
mance.  Therefore,  one  should  chooBe  adequate  windowing 
procedure  rather  by  carefully  examining  the  physical 
nature  ol  the  particular  target/sensor  models  used  in 
the  system.  For  this  reason,  GTC  relies  on  a  user- 
provided  external  routine  for  performing  adequate  win¬ 
dowing  and  simply  receives  the  zero  likelihood  when  a 
measurement  data  is  not  validated. 


(c)  Clustering:  Because  of  our  i.i.d.  assumptions 
made  in  Section  A,  when  the  validation  region  of  each 
track  is  not  exceptionally  large,  we  can  further  decom¬ 
pose  the  basic  equation  (3)  or  (11).  In  such  a  case, 
we  can  group  tracks  and  measurements  so  that  local 
hypotheses  on  such  a  group,  called  a  cluster .  may  be 
generated  and  evaluated  locally.  For  this 


decomposition  to  be  valid,  any  two  tracks  across  two 
different  clusters  should  not  intersect.  If  this  con¬ 
dition  is  satisfied,  the  set  of  global  hypotheses  can 
be  reconstructed  by  forming  all  the  possible  unions  of 
local  hypotheses.  Thus  clustering  techniques  are  in 
general  methods  for  maintaining  this  non-intersection 
condition.  An  algorithm  for  performing  such  a  task  was 
described  in  [13]  and  the  current  GTC  adopts  this  tech¬ 
nique. 


7.  COMPARISON  WITH  OTHER  ALGORITHMS 

The  most  significant  difference  of  our  basic  for¬ 
mulation  from  others  is  that,  for  each  data  set,  we  do 
not  assume  the  probability  of  detection  but  rather 
specify  a  detection  function  which  is  actually  the  con¬ 
ditional  probability  given  a  target  state,  i.e., 
state-depenuent  detection  probability.  In  many  cases, 
we  cannot  correctly  model  tensors  without  state- 
dependent  detection  probability.  For  example,  a  target 
may  be  out  of  a  tensor  f ield-cf-view  or  hidden  other¬ 
wise  in  an  unobservable  region,  if  we  cannot  have  a 
common  probability  of  detection  for  all  the  tracks  in  a 
data  set,  the  most  commonly  used  binomial  distribution 
on  the  number  of  detected  targets  (originally  used  in 
‘[13]  and  subsequently  used  in  many  other*  such  as  [6], 
[10],  etc.)  is  no  longer  adequate.  Instead  the  Bayc-  . 
sian  expansion  via  the  random  set  I-  of  detected  tar¬ 
gets  and  the  random  assignment  function  A  at  each  data 
set  k  must  be  employed. 

In  many  applications,  however,  the  detection  pro- 
oability  function  is  determined  only  by  a  sensor 
f ield-of-view  as  PD(x)  -  p^_.„  Wn(x)  ,  e.g„,  where 


rD(x)  -  J  g(y  -  Hx;  R)  dy, 

<17 

g(.;R)  it  the  zero-mean  mulri-diuntional  gaussian  den¬ 
sity  with  variance  R,  <y  is  the  f ield-of-view  and  y  - 
Hx  ♦  noict  is  the  measurement  equation  with  error  vari¬ 
ance  R.  In  such  a  case,  the  target-state-measurement 
transition  probability  density  becomes 


PM(ylx)  *  g(y  -  Hx;  R)  /  *B(x). 


on  4).  Therefore,  in  such  special  cases,  if  a  track 
has  £.  priori  state  distribution  which  is  gaussian  with 
mean  $  and  variance  P,  the  track-measurement  likelihood 
defined  by  (6)  becomes  Pj. g(y-Hx;HPHI+R) ■  Moreover 
i  he  track-no-measurement  likelihood,  I 


/« 


PD(x))g(x  -  x;  P)dx 


can  be  approximated  by  (1- p^)  i£  „x  it  well  iiside 
°J  and  HPH*+K  is  not  exceptionally  large.  Exactly 
these  two  terms  appeared  in  Reid's  algorithm  in  £  J.  3  ] . 


The  crucial  difference  of  our  algorithm  in  sAch 
gaussian  cases  is  the  treatment  of  likelihood  L(y^T) 
where  r  ia  formed  for  the  first  time.  In  Reid's 
rithm,  such  an  L(y.r)  is  constant  and  is  called  ne|u 
target  density  .  For  example,  when  we  observe  tar¬ 
gets  moving  at  almost  constant  velocities  in  a  one- 
dimensional  space  with  a  relatively  high  detection  pro¬ 
bability  P[j_av,  as  data  sets  are  accumulated  we  expect 
newly  detected  targets  to  appear  only  on  the  edges  of 
the  f ield-of-view.  Only  by  calculating  this  likelihood 
tt  t  function  of  measurement  value,  can  we  adequately 
incorporate  such  an  efiect  into  track  initiation.  In 
many  applications,  the  processing  of  severel  measure¬ 
ments  right  after  a  track  is  formed  tor  the  first  lime 
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it  very  iaportant  since  it  serves  as  a  multi-scan  tar¬ 
get  detector.  In  such  cases,  an  exact  calculation  of 
this  likelihood  oay  be  very  iaportant. 

As  shovn  in  [13!,  in  a  typical  situation,  we  oust 
fora  many  hypotheses  in  several  data  sets  after  a  track 
is  formed  for  the  first  time  and  these  hypotheses  even¬ 
tually  become  a  single  probability-one  hypothesis  by 
means  of  pruning  and  combining.  After  a  track  is  “con¬ 
firmed."  or  “initiated"  in  such  a  way,  the  GTC  algorithm 
behaves  in  a  way  very  similar  to  that  of  JPDA  algorithm 
described  in  [131  if  the  likelihood  of  receiving  sig¬ 
nals  from  newly  detected  targets  is  very  low  in  the 
validation  regions  of  the  track.  As  a  matter  of  fact, 
if  we  set  the  newly  detected  target  likelihood  func¬ 
tions  to  be  all  zero  and  start  GTC  algorithm  with  a 
single  a  priori  probability-one  hypothesis,  we  can 
reduce  our  algorithm  to  JPDA  with  gaussian  assumptions 
and  appropriate  approximations. 

Equ.  (11)  in  Section  5  gives  a  general  formula 
both  for  the  track-likelihood  approach  and  for  the 
batch-processing  approach.  An  algorithm  using  the 
former  approach  is  described  in  [14]  while  one  using 
Che  latter  is  described  in  [121,  both  with  gaussian 
models. 

8.  CONCLUSIONS 

A  system  for  the  tracking  and  bias sit icat ion  of 
multiple  tracks  by  multiple  sensors!,  called  GTC,  has 
been  described.  This  system  was  built  as  the  first 
implementation  of  the  general  Bayesian  multitarget 
tracking  formulation  which  the  authors  hsd  developed 
esrlier.  By  this  implementatio  ,  we  can  reasonably  han¬ 
dle  the  problems  of  state-dependent  detection  probabil¬ 
ity  and  target  initiation  processes.  Also,  within  the 
i.i.d.  assumptions,  fairly  general  .target/sensor  models 
can  be  handled.  As  seen  in  Section  4,  however,  non¬ 
linear  filtering  problem  is  alvays  a  major  subproblcm 
in  any  multitarget  tracking  problem.  3eyond  Kalman  or 
extended  Kalman  filters  or  totally 'discrete  estimation 
problems,  nonlinear  filtering  problems  are  very  diffi¬ 
cult  to  handle.  Thus  good  solutions  to  such  filtering 
problems  are  still  essential  to  a  successful  implemen¬ 
tation  when  non-gaussian  models  are  required.  Although 
the  problem  of  dependence  among  targets  has  been 
theoretically  solveu,  further  studj.es  are  necessary 
before  a  general  algorithm  capable' of  handling  such 
complicated  situations  can  be  implemented. 
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I.  Introduction 

— 'The  special  structure  of  surveillance  systems  has 
been  recognized  and  exploited  in  the  development  of 
relevant  mathematical  models  for  some  t ime .''frevgy-; -see 
( 1 ) ,  1 24) . .  These  models  provide  the  necessary  rela- 
tlonships  for  applying  probabilistic  techniques  to  sur¬ 
veillance  problems  and  therefore  determine  the  complex¬ 
ity  of  the  resulting  solutions.  Typically,  such  solu¬ 
tions  are  very  complex  and  require  unreasonably  large 
amounts  of  memory.  For  this  reason,  varicus  algorithms 
have  been  developed  which  restrict  the  amount  of  comp¬ 
lexity  at  the  possible  expense  of  system  performance. 
The  purpose  of  this  paper  is  to  formulate  a  mathemati¬ 
cal  framework  in  which  analytical  comparison  of  these 
algorithms  may  be  performed.  > _ _ 


attempt  to  reduce  the  computational  complexity  of  these 
calculations  are  then  considered  in  a  framework  which 
allows  them  to  be  compared.  The  basis  for  this  frame¬ 
work  is  the  view  that  each  algorithm  is  in  effect,  com- 
— “■  puting  approximations  to  the  desired  statistics. 


II.  Mathematical  Foundations 

A  precise  description  of  the  dynamic  evolution  of 
states  in  a  surveillance  system  is  given  by  the 
following  mathematical  model. 


Xfcfl  “  f<Yk)xk  +  Wk 

(1) 

yk  ”  H(Yk)xk  +  Vk 

C2) 

The  special  structure  of  surveillance  systems  may 
be  seen  by  examlng  two  Important  and  related  surveil¬ 
lance  problems;  target  tracking  and  identification.  In 
the  most  genetal  setting,  identification  may  include 
tasks  such  as  data  association,  data  type  identifica¬ 
tion,  target  type  identification  and  maneuver  detec¬ 
tion.  These  casks  are  "discrete"  types  of  tasks  in 
that  a  decision  amongst  a  finite  number  of  hypotheses 
must  be  made.  Tracking  problems  on  the  other  hand,  are 
"continuous”  in  nature  since  the  position/velocity  of 
each  target  may  take  on  any  of  a  contlnum  of  values. 

The  result  of  these  observations  leads  to  the  fact  that 
the  mathematical  models  which  characterize  the  unknowns 
in  a  surveillance  system  contain  both  discrete  and  con¬ 
tinuous  states.  Note  that  the  measurements  which  pro¬ 
vide  information  for  performing  tracking  and  identi¬ 
fication  tasks  may  be  discrete  or  continuous. 

Systems  which  require  such  a  "hybrid  state"  form¬ 
ulation  are  not  necessarily  unreasonably  complex.  How¬ 
ever,  in  addition  to  this  hybrid  structure,  we  must 
also  consider  Chat  these  problems  are  dynamic.  In  par¬ 
ticular,  the  discrete  states  which  characterize  uncer¬ 
tainty  in  identification  problems,  may  change  with 
time.  This  behavior  reflects,  for  examole  the  arbi¬ 
trary  ordering  of  radar  data,  the  scintillation  of 
clutter  returns,  and  the  execution  of  maneuvers  by  the 
target.  As  we  will  see,  the  complexity  of  certain 
solutions  to  surveillance  problems  is  directly  related 
to  the  dynamic  behavior  of  the  discrete  states. 


where  at  time  k,  the  continuous  state  of  the  system  is 
xk,  the  discrete  state  is  Yk,  the  observable  quantity 
Is  yk,  and  wfe  and  vk  are  white  noises  whose  covariance 
matrices,  Qg  and  Rk ,  in  general  may  also  depend  on  yk 
(l.e.,  0k_Q(k,Yk),  Rk-R(k.Yk)).  Note  that  the  case 
where  F  and  H  are  time  varying  and  a  control  input  is 
present  in  (1)  may  be  easily  Included  and  the  results 
that  follow  will  remain  unchanged. 


The  parameter,  yk,  is  called  the  status  parameter 
'.nd  takes  on  one  of  a  finite  number  of  values  in  a  set 
S|(  at  each  time  k.  We  model  the  hlerarchal  structure 
mentioned  previously  by  assuming  that  yk  is  a  Markov 
chain  with  known  one  step  transition  probabilities. 
That  is. 


L 

P(fk  -  si)  -  E  Hfj.pCYk-l 
J-l 


(3) 


where  Sk  -  <  s ; ,82, . . .»L f  and  IIij  -  p(yk*si  lYk-fSj  )  • 
The  problems  of  identification  and  tracking  in  surveil¬ 
lance  systems  reduces,  in  this  framework,  to  the  esti¬ 
mation  of  Y|t  and  xk  based  on  a  set  of  observations,  Ym 
•I  Jfi!l<  1  <  in  j.  As  is  the  case  with  all  dynamic 
estimation  problems,  we  can  consider  smoothing  (k<m), 
filtering  (k“m)  and  prediction  fk>m)  problems  depending 
on  the  goals  of  the  specific  surveillance  system.  In 
each  of  these  cases,  estimates  may  be  computed  if  cer¬ 
tain  sufficient  statistics  are  known. 


In  contrast  to  the  resultant  complexity  associated 
with  dynamic  hybrid  state  space  characterizations,  many 
problems  in  surveillance  can  be  simplified  by  recog¬ 
nizing  and  modelling  their  hierarchical  structure.  That 
is,  it  is  frequently  advisable  to  consider  a  subset  of 
hybrid  state  problems  in  which  the  discrete  state,  or 
states,  evolve  independently  in  time  and  the  continuous 
states  and  observations  are  explicitly  dependent  on  the 
dynamic  realization  of  the  discrete  variables. 

In  the  following  sections  we  will  review  a  w  1  '"ily 
used  mathematical  model  and  the  methods  by  which  suf¬ 
ficient  statistics  for  estimating  the  continuous  states 
(tracking)  and  for  making  decisions  about  the  discrete 
states  (Identification)  are  obtained.  Algorithms  which 


For  example.  If  estimates  of  xk  are  of  primary 
Importance,  then  we  desire  a  characterization  of  the 
conditional  probability  density  function  (pdf), 
p(xk|Ym).  Other  statistics  of  importance  to  identifi¬ 
cation  and  tracking  Include  p(Yk|Y“)  and  p(xk,Yk|Ym) 
and  p(yk|Y^).  For  simplicity,  in  all  that  follows  we 
will  concentrate  on  characterization  of  the  marginal 
pdfs  for  xo  and  yk. 

The  solution  to  the  problem  of  characterizing  the 
marginal  pdfs  is  obtained  by  a  direct  application  of 
the  partitioning  theorem  ( 3 ] . 

p(*k|Yk)  -  l  p(rk|Yk)  p(xk|Yk,rk)  (4) 

I* 
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where  rkj“  Yj:l<J<kj  .  This  result  is  very  useful  for 
two  important  reasons.  First,  the  conditional  pdf's, 
are  Gaussian  (under  the  assumption  that  x0 
is  Gaussian)  and  are  computable  from  Kalman  filter 
equations  matched  to  the  various  status  sequences  [41. 
Secondly,  the  status  sequence  probabilities  can  also 
be  computed  recursively.  In  particular,  these  equa¬ 
tions  separate  into  predict  and  update  operations  and 
are  given  by. 

Predict:  p(rk+l|Vk]  -  p(fk+l  |vic)p(rk|  Y*1)  (5) 

Update:  p(l*|Yk)  -  pf  w  lrk.Yk-’ ;W  I* | Y^U  (6) 

p(ykl*kl) 

Finally  we  have, 

P(Tk|Yk)  -  l  p(rk|Yk)  (7) 


What  the  partitioning  approach  has  done  is  allow 
us  to  characterize  the  desired  pdf.'s  by  a  finite, 
albeit  growing,  number  of  parameters.  In  effect,  it 
has  Increased  the  number  of  hypotheses  which  must  be 
considered  at  time  k  from  the  set  of  values  which  Yg 
may  realize,  to  the  set  of  values  which  F*1  may  realize. 
This  growing  number  of  hypotheses  with  time  presents 
the  fundamental  limitation  to  solving  surveillance 
problems  via  pdf  computation  and  provides  the  primary 
motivation  for  considering  suboptlmal  (i.e.,  approxi¬ 
mate)  algorithms. 

The  most  commonly  used  algorithms  in  practical 
surveillance  systems  can  all  be  viewed  as  methods  of 
approximating  the  desired  pdf.'s  which  are  optimally 
computed  from  (4)  through  (7).  These  approximations 
involve  some  form  of  hypothesis  reduction  technique  in 
which  the  number  of  status  sequences,  rk,  which  must  be 
considered  at  each  time,  is  kept  under  some  limit. 

Table  2-1  lists  the  most  common  types  of  hypothesis 
reduction  techniques  and  is  organized  to  show  that 
these  techniques  can  be  classified  as  either  pruning 
algorithms  or  merging  algorithms. 

1.  Pruning  Algorithms 

a.  Screening  (Gating) 

b.  Decision  Directed  (Hypothesis  Testing) 

c.  Maximum  Likelihood 

2.  Merging  Algorithms 

a.  Local  Moment  Invariance 

(i)  fixed  hypothesis  selection 
(ii)  adaptive  hypothesis  selection 

b.  Best  Local  Gaussian  Representation 


Table  2-1.  List  of  Hypothesis  Reduction  Techniques 


III.  Comparison  of  Hypothesis  Reduction  Techniques 

Comparing  the  various  hypothesis  techniques  i  i  a 
particular  surveillance  system  usually  Involves  exten¬ 
sive  Monte  Carlo  simulation  and  can  be  quite  costly. 

In  addition,  such  simulation  methods  provide  little  in¬ 
sight  into  the  importance  of  various  parameters  of  the 
algorithms.  In  this  section,  a  framework  and  some  sam¬ 
ple  results  are  developed  which  illustrate  the  advan¬ 
tage  of  using  analytical  methods  in  the  comparison  and 
evaluation  of  survel  Hence  algorithms. 

The  hypothesis  reduction  techniques  of  Table  2-1 
are  used  In  surveillance  algorithms  In  a  variety  of 


ways  [S[.  In  order  to  compare  the  effectiveness  of 
each  of  these  techniques  we  will  consider  a  class  of 
fixed  structure  algorithms  which  differ  only  in  the 
type  of  hypothesis  reduction  techniques  that  are  used. 
The  algorithm  is  executed  as  follows: 

1.  Compute  the  desired  pdf.  [e.g.,  p(x|JYk)]  up  to 
some  maximum  time,  k»N,  at  which  point  some  form  of  hy¬ 
pothesis  reduction  technique  is  necessary  to  reduce  the 
computational  burden. 

2.  Apply  a  single  hypothesis  reduction  technique  and 
interpret  the  remaining  hypotheses  as  an  approximate 
pdf.  (e.g.,  if  at  time  k-N,  there  are  2N  hypotheses 
corresponding  to  each  rN  sequence  when  S»j  0,1  {,  then 
the  actual  pdf  p(x)c|Yk)  is  a  weighted  sum  of  2^  Gaus¬ 
sian  densities;  Eq.  (4).  If  we  prune  half  of  the  hy¬ 
potheses  and  renormalize  their  posterior  pre  abilities, 
then  the  pdf  approximation  is  a  weighted  sum  of  2N_1 
Gauss Ians). 

3.  Compute  an  approximate  pdf  at  time  k-N+1  by 
applying  the  recursive  pdf  equations  to  the  remaining 
hypotheses  as  if  they  represented  the  actual  pdf. 

4.  Repeat  Step  2  and  Step  3. 

Figure  1  shows  the  optimal  algorithm  and  the  class 
approximate  algorithms  described  above  in  block  diagram 
form.  The  predict  and  update  equations  of  the  Kalman 
filters  associated  with  the  computation  of  p(x^|rk,Y^) 
and  the  predict  and  update  equations  given  by  Eqs.  (5) 
and  (6)  are  shown  explicitly  at  each  time  step.  The 
hypothesis  reduction  technique  is  contained  in  the 
approximation  block  and  is  executed  at  each  discrete 
time  k. 

As  the  figure  suggests,  the  optimal  estimates  (of 
xk  or  Tk  °K  both)  can  be  obtained  only  when  the  actual 
pdf  is  available.  The  class  of  algorithms  described 
above  is  capable  of  providing  suboprlmal  estimates 
based  on  approximate  pdfs  which,  apparently  become  less 
accurate  with  time.  The  comparison  of  various  hypothe¬ 
ses  reduction  techniques  is  now  accomplished  by  quanti¬ 
fying  the  accuracy  of  the  pdf  approximations  which 
result  from  specific  techniques  and  comparing  the 
results. 

The  class  of  probability  density  functions  with 
which  we  are  dealing  constitutes  an  abstract  vector 
space.  The  predict,  update  and  approximation  algo¬ 
rithms  can  be  viewed  ns  operations  which  map  a  pdf  at 
time  k,  p(k),  into  pp(k),  Pu(k) >Aand  p(k)  respectively. 
The  accuracy  with  which  the  pdf  p(k)  approximates  p(k) 
can  be  measured  by  considering  a  vector  distance  mea¬ 
sure,  D($(k),p(k)j  defined  on  the  vector  space  of 
pdf.'s.  Many  such  measures  are  available  and  a  few  are 
defined  below. 

The  L2  norm  has  many  useful  properties  and  is 
defined  by; 


b2(p,pl  -  /  (p(x)-p(x))2dx  (8) 

X 

where  X  is  the  state  space  for  Che  random  variable,  x. 
Note  that  both  marginal  pdf.'s  ,  p(xjt|Yk)  and  p(Ykl*k) 
as  well  as  the  Joint  pdf  p(x^,Yp|Yk)  can  be  considered 
in  (8)  by  interpreting  the  Integral  in  the  Stiltjes 
sense. 

The  Bhattacharrya  distance  is  defined  as  the 
negative  log  of  the  B-coeff iclent ,  p,  which  is  defined 
by. 

Plp.pl  *  f  [p(x)p(x))l/2  dx  (9) 
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Figure  1.  Visualization  of  PDF  Computations 


Another  useful  measure  is  the  Kolmogorov  distance  or  Lj 
norm  which  is  defined  by. 


K[p,pl  -  2.  /  |p(x)-p(x)|  dx  (10) 

2 

Ultimately,  we  would  like  to  oe  able  to  choose  one 
of  these  measures  and  solve  for  the  distance  between 
$(x)  and  p(x)  at  each  time  k.  This  distance  must, 
clearly,  be  a  function  of  the  mathematical  model  of 
Eqs.  (1)  and  (2)  anj  the  specific  approximation  which 
Is  used.  It  Is  assumed  that  hypothesis  reduction  tech¬ 
niques  which  result  in  better  pdf  approximations  (smal¬ 
ler  distance  measures)  should  result  in  better  surveil¬ 
lance  system  performance. 

As  an  example  of  the  types  of  results  which  are 
useful,  we  now  consider  the  K-distance  as  a  measure  of 
performance  and  state  thr"e  theorems  relating  the  mea¬ 
sure  K[$(k),  p(k) ]  to  K[p(k+1),  p(k+l)J.  The  proofs  cf 
these  theorems  can  be  found  In  reference  [6],  (See 
Figure  1  for  definitions). 

Theorem  1 . 

Given  K[p(k),  p(k)J  •  4,  then  after  the  prediction  op¬ 
eration. 


K[Pp(k),pp(k)J  <  6 

Theorem  2. 

Given  K[pp(k) ,pp(k) )  •  4,  then  after  an  update 
operation, 

Klpu(k) ,pu(k) ]  <  24 

Finally,  since  K  Is  a  valid  norm  (it  obeys  the  triangle 
Inequality)  we  can  state  our  last  theorem. 

Theorem  3. 

Let  the  K-distance  across  the  approximation  operation 
at  time  k  be  denoted  by  4k.  Then 

K[p(k+l),p(k+l)|  <  24k  +  4k+1 


The  results  above  suggest  that  the  distance  across  the 
approximation  segment  are  the  fundamental  determinants 
of  the  performance  capabilities  of  a  surveillance  algo¬ 
rithm  which  utilizes  a  specific  hypothesis  reduction 
technique.  In  the  next  section,  two  widely  used  re¬ 
duction  techniques  are  compared  analytically  and  by 
simulation. 

IV,  Comparison  of  Maximum  Likelihood  Pruning 

and  Local  Moment  Invariance  Merging  Algorithms 

As  we  have  stressed  before,  hypothesis  reduction 
techniques  can  be  viewed  as  pdf  approximation  methods. 
As  a  simple  example  of  this  concept,  consider  a  pdf  for 
some  random  variable,  x,  say, 

F(x)  -  pjf  j ( x)  +  p2f2(x)  (11 

where  P1+P2  “  1,  f*(x)  are  normal  densities  with  means 
mi,  and  variances  0£  respectively.  Thus,  x  might 
represent  an  element  of  the  continuous  random  vector  xk 
In  (1)  and  (2),  and  F(x)  may  represent  an  approximation 
to  the  posterior  pdf  which.  In  general.  Is  a  weighted 
sum  of  many  Gauss Ians.  Two  approximations  to  F(x)  are 
defined  below; 

Fp(x)  -  f  arg  max  pi  (x)  . . -  02) 

2  2  2  2  2 

Fm(x)  “  Nx  t»“Pimi  +  P2B>2  ;  pi(ol+ml)+P2^«2+n2J“0  I 

03) 

where  the  notation  Nx(m;o2]  denotes  a  Gaussian  density. 
In  x,  with  mean  m,  and  variance  o^. 

The  function  F-,(x)  in  (12)  is  the  pdf  approxima¬ 
tion  generated  by  a‘  Maximum  Likelihood  Pruning  rule  for 
hypothesis  reduction.  That  Is,  if  we  are  considering 
two  hypotheses ,  each  of  which  give  rise  to  a  Gaussian 
density  for  x  and  their  corresponding  posterior  proba¬ 
bilities  p^,  then  the  pruning  rule  simply  drops  the 
parameters  associated  with  the  smallest  posterior  prob¬ 
ability. 

On  the  other  hand,  the  function  Fa(x),  Is  the  pdf 
approximation  associated  with  a  certain  type  of 
hypothesis  merging  algorithm.  Again,  1*  we  view  pt  as 
a  posterior  hypothesis  probability  and  fj(x)  as  an  hy¬ 
pothesis  conditioned  pdf  of  x,  then  the  merging  rule 
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.laces  the  two  hypotheses  with  a  single  one.  The 
lgle  hypothesis  conditioned  pdf  is  Caussian  with  the 
-st  two  moments  being  equal  to  the  first  two  moments 
the  weighted  sum  of  two  Gaussians  defined  by  (11). 
mce  the  name,  local  moment  invariant  merging). 

Now,  as  a  result  of  the  reasoning  in  the  last  sec- 
m  we  can  compare  these  two  hypothesis  reduction 
:hnlques  by  examining  the  behavior  of  the  K-dlstance 
a  function  of  the  specific  densities  involved.  Fig. 
I  shovs  the  numerically  computed  value  of  K[Fp,F]  and 
'b,F]  when  oj  «  o2  -  1,  p^  -  -  0.5,  m^  ■  0,  and 

varied.  It  is  quite  clear  that,  for  this  particular 
:  of  pdf.'s,  neither  pruning  nor  merging  are  superior 
thods  in  all  circumstances.  Pruning  seems  to  be  a 
oericr  approximation  when  m;  is  large,  while  merging 
far  superior,  in  terms  of  K-dlstance,  when  m 2  is 
all.  Thus,  in  practical  surveillance  systems,  it  is 
portant  to  consider  the  difference  between  the  means 
pdf.'s  corresponding  to  two  hypotheses  before  decid- 
g  whether  to  merge  the  two  hypotheses  or  prune  the 
ast  likely  one. 


0  >  4  t  *  10  12  U  16  18  20 


/* 

»■ l«l> 

F(x)  -  1/2  [f i(x)  +  f2(x)I 
fj(x)  •  Nx(0; 1) 
f2(x)  -  Nx^.l) 

Fp(x)  «  f 1 (x)  or  f2(x) 

Fm(x)  -  1 Sm|/4} 


Figure  2 

K  -  Distance  Between  Actual  and  Approx.  PDF.s 


COMTAXtSON  OF  ALGORITHMS  M  •  2 


Figure  3.  Comparison  of  Pruning  (PRN)  and  Merging 

(MXC)  Hypothesis  Reduction  Techniques  With 
Max  Number  of  Hypothesis  of  3N,(N-2  and 
N-3) 


smaller  than  that  obtained  by  pruning.  Furthermore, 
the  pruning  algorithm  only  performs  about  as  well  as  a 
Kalman  filter  (KF)  based  on  the  constant  velocity 
straight  line  motion  model  with  an  artificially  in¬ 
creased  design  process  noise  covariance.  However,  when 
N-3,  so  that  3^-27  hypotheses  are  retained,  then  it  is 
the  pruning  algorithm  whose  RMS  estimation  error  is 
lower  following  the  onset  of  the  maneuver. 


Similar  affects  are  seen  in  the  performance  of 
fiese  two  algorithms  in  a  realistic  scenario.  Fig.  3 
hows  the  results  of  a  detailed  simulation  of  the  two 
ypothesls  reduction  techniques  (pruning  and  merging) 
n  a  maneuvering  target  detection/ estimation  problem, 
he  general  algorithm  of  section  III  using  these  lwc 
peeiflc  reduction  methods  was  applied  to  the  problem 
f  estimating  the  position  of  a  target  whose  planar 
ynamics  could  be  described  by  one  of  three  possible 
lnear  difference  equations.  Using  noisv  measurements 
f  position,  each  estimator  was  simulated  over  an 
nltlal  time  interval  where  the  target  moves  in  a 
tralght  line  with  constant  velocity  followed  at 
ime  k-30,  by  either  a  left  or  a  right  turn  of  constant 
ngular  velocity.  The  performance  of  each  algorithm  is 
ssessed  by  examining  the  RMS  position  estimation  error 
s.  time.  (More  details  of  the  algorithms  can  be  found 
n  16].) 

When  the  actual  pdf  is  computed  up  to  time  k-N-2, 
ind  subsequent  approximations  keep  the  number  of  hypo- 
.hesee  reduced  to  3N-9,  we  see  that  after  the  c -.set  of 
»  maneuver  at  k-30,  the  merging  technique  produces  a 
•osltlon  estimate  whose  RMS  error  is  significantly 


Thus,  we  have  seen  analytically  by  comparing  K- 
dlstances,  and  experimentally  by  simulating  specific 
algorithms,  that  neither  maximum  likelihood  pruning  nor 
local  moment  Invariance  merging  result  in  superior  sur¬ 
veillance  algorithms  in  every  Instance. 


I  V.  Conclusions 


By  viewing  the i hypothesis  reduction  techniques 
which  appear  in  many  surveillance  algorithms  as  pdf 
approximation  methods  we  have  developed  an  analytical 
framework  in  which  these  algorithms  may  be  compared. 

It  was  concluded  that  the  quality  of  any  surveillance 
algorithm  can  be  measured  by  the  distance  between  the 
pdf  approximations  before  and  after  the  hypothesis  re¬ 
duction  technique  is {implemented.  By  examining  a  sim¬ 
ple  example  of  two  commonly  used  reduction  techniques, 
we  were  able  to  show  that  neither  technique  is  always 
superior  and  could  point  to  some  of  the  parameters 
which  might  Influence  a  decision  about  which 
technique  to  employ.  In  addition,  simulation  of  the 
two  algorithms  in  a  planer  maneuvering  target  tracking 
problem  exhibited  the  same  affects  that  were  pointed 
"ait  by  the  simple  analytical  results. 


Besides  the  results  described  above,  a  number  of 
important  issues  can  be  addressed  within  the  framework 
that  has  been  described.  First,  since  the  distance 
measures  described  in  Section  III  are,  typically,  dif¬ 
ficult  to  compute,  it  is  advantageous  to  consider  com¬ 
putable  bounds  on  these  distances.  Some  work  in  this 
area  has  been  described  in  [6]  and  [7]  although  these 
bounds  can  be  very  loose  [6].  Better  (tighter)  bounds 
are  necessary  if  accurate  comparisons  are  to  be  made 
and/or  if  these  bounds  are  used  in  adaptive  algorithms. 
Another  issue  which  requires  some  attention  is  the 
relationship  of  distance  measures  to  more  relevent  fig¬ 
ures  of  merit  like  RMS  estimation  error.  Finally, 
since  we  are  comparing  surveillance  algorithms  by  exam- 
ing  their  relationship  to  the  optimal  algorithm  which 
would  compute  the  desired  pdf's  exactly,  the  issue  of 
how  well  this  optimal  algorithm  can  perform  is  of  fund¬ 
amental  interest.  Ultimately,  there  is  always  a 
fundamental  limit  on  the  quality  of  any  surveillance 
syscem.  This  limit  is  a  function  of  the  parameters 
which  describe  the  underlying  phenomena  (e.g.,  signal 
to  noise  ratios,  likelihood  of  discrete  state  changes, 
continuous  state  observability,  etc.)  and  the  perform¬ 
ance  of  the  optimal  algorithm  is  one  means  of  assessing 
the  important  parameters  of  these  systems. 
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ABSTRACT 

'The  INtelligent  Correlation  Agent  (INCA)  project,-- - 
ponncred  by-OARPA,  if  directed  at  the  development  and 
lemonstration  of  advanced  technologies  for  supporting 
,ata  fusion.  A  broad  spectrum  of  technologies,  includ- 
ng  Artificial  Intelligence,  emc-ging  personal  worksta- 
.ions,  and  statistical  decision  theory,  have  been 
examined  and  used  in  this  project.  Central  to  recent 
tork  has  been  the  use  of  symbolic  c'-mp-to  s,  using 
wject-oriented/messagepassing  techniques.  The  result 
las  been  an  environment  which  elevates  the  level  at 
ihich  developers  and  analysts  can  explore  fusion 
:echniques.  Powerful  graphics  interaction  has  been  so 
.uccessful  that  the  current  INCA  system  has  been  used 
is  a  prototype  workstation  for  operational  data  fusion 
.ys terns.  An  incremental  development  methodology,  using 
eedback  from  experienced  analysts,  is  being  employed 
n  the  system  development.  This  paper  discusses 
ieveral  of  the  key  aspects  of  the  INCA  system.  . 

\ 

INTRODUCTION 

The  INtelligent  Correlation  Agent  (INCA)  project, 
sponsored  by  DARPA,  is  directed  at  the  development  and 
lemonstration  of  advanced  technologies  for  supporting 
saotical  data  fusion.  A  broad  spectrum  of  technolo- 
jies,  including  Artificial  Intelligence,  emerging 
lersonal  workstations,  and  statistical  decision  theory, 
lave  been  examined  and  used  In  this  project.  Central 
:o  recent  work  has  been  the  use  of  symbolic  computers, 
ising  objectorierted/message-passing  techniques.  The 
•esult  has  bean  an  environment  which  elevates  the  level 
»t  which  developers  and  analysts  can  explore  fusion 
nethods.  Powerful  graphics  interaction  has  been  so 
successful  that  the  current  INCA  system  has  been  used 
is  a  prototype  workstation  for  operational  data  fusion 
systems.  An  incremental  development  methodology,  using 
feedback  from  the  experienced  analysts,  is  being 
employed  in  the  system  development.  This  paper  dis¬ 
misses  several  of  the  key  aspects  of  the  INCA  system. 

The  first  section  of  this  pap^r  describes  some  of 
the  research  goals  that  initially  motivated  the  work. 
Then  a  section  describing  the  novel  software/hardware 
aspects  of  this  project  is  presented.  In  the  third 
section,  we  focus  on  the  current  machine  functionality 
in  both  the  algorithmic  development  and  workstation 
prototyping.  Next,  a  discussion  of  how  expert  assis¬ 
tance  is  being  integrated  into  INCA  is  presented. 
Finally,  future  plans  for  the  project  are  discussed. 

INITIAL  MOTIVATION  FOR  THE  INCA  WORK 

Our  project  focus  is  the  generation  of  enhanced 
targeting  information  by  integrating  data  from  multiple 
sources.  In  particular,  we  wish  to  address  the  problem 
of  creating  ocean  surveillance  scenes  where  the  data 
rate  per  platform  is  often  quite  low  and  t^'c  content  of 
the  data  does  not  result  in  unique  identification  of 
the  platform.  As  a  consequence,  there  exists  consider¬ 


able  uncertainty  in  the  association  of  reports  to 
specific  platforms,  resulting  in  ambiguity  in  the  ocean 
surveillance  scene  (i.e.,  the  set  of  platform  tracks 
with  uniquely  associated  reports).  Of  course,  sensor 
systems  are  continually  supplying  new  data  and  this 
must  be  incorporated  into  the  evolving  scenario. 

The  basic  approach  taken  to  resolve  the  ambiguity 
problem  is  shown  in  Figure  1.  At  the  top  of  the  tree 
is  the  set  of  initial  data  from  which  a  set  of  candi¬ 
date  scenes  are  generated.  Since  there  is  confusion  in 
the  data,  several  scenes  may  be  viable  alternatives. 
The  next  partition  of  data  (for  example,  the  newest  set 
of  data  to  arrive)  is  then  used  to  extend  some  number 
of  scenes  from  the  previous  cycle.  The  cbvlous  combin¬ 
atorial  explosion  must  somehow  be  controlled. 


Figure  1.  INCA:  INtelligent  Correlation  Agent 
Explosive  Growth  of  Alternative  Scenes 

An  immediate  question  to  addresc  is  what  type  of 
fusion  process  is  occurring  at  each  node.  In  general, 
there  are  a  large  number  of  fusion  techniques  avail¬ 
able.  However,  it  is  often  necessary  to  guide  any  such 
algorithms  to  a  successful  or  computationally  accept¬ 
able  solution.  At  the  beginning  of  the  project  th» 
knowledge  of  when  to  employ,  or  how  to  tune,  an  algo¬ 
rithm  was  not  fully  understood.  Thus,  an  initial  goal 
for  INCA  was  the  building  of  a  system  in  which  a  high 
degree  of  user  interaction  would  allow  analysts  to 
develop  "rules"  about  when  and  how  to  apply  different 
correlation  methodologies. 

In  real-world  data  processing,  human  interaction 
with  most  processing  operations  must  be  limited  to 
produce  results  in  a  timely  fashion.  Hence,  the 
automation  of  selection  and  control  of  fusion  node 
processing  was  needed.  In  fact,  because  of  the  branch¬ 
ing  and  multiplicity  of  algorithms  to  be  supported,  it 
was  clear  that  operating  the  system  would  require 
significant  expertise.  Thus,  a  secondary  thrust  of  the 
project  was  the  incorporation  of  an  “expert  system"  to 
run  or  provide  assistance  to  future  users. 

The  project  also  capitalizing  on  other  artificial 
intelligence  techniques  to  adoress  the  search  problems 
intrinsic  in  the  tree  formulation  of  multiple  scene 
generation. 
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INCA  HARDWARE  AND  SOFTWARE 


Researchers  in  data  fusion  are  well  aware  of  the 
difficulty  inherent  in  data  fusion  processing  and  the 
complexity  of  software  development.  The  use  of  Fortran 
code  onarating  on  large,  time-shared  computers  is  the 
traditional  development  tool  for  testing  new  techniques 
and  methodologies.  The  goals  for  this  project  lead  us 
to  consider  new  approaches  to  the  correlation  software 
development.  Initial  work  on  INCA  was  undertaken  using 
Interlisp  on  Dec-10  systems.  The  power  and  superior 
programming  environment  was  recognized  immediately  as  a 
tool  to  get  leverage  over  problem  complexity.  In 
addition  to  the  obvious  advantages  of  an  interactive 
language  with  dynamic  storage  allocation,  our  motiva¬ 
tion  for  a  Lisp-like  language  was  influenced  by  the 
need  to  include  limited  AI  aspects  in  the  project. 
Additionally,  the  use  of  list  structures  fo>"  track  and 
hypothesis  representation  seemed  particularly  appeal¬ 
ing. 

Our  initial  Interlisp  code  (1981),  which  had  very 
weak  correlation  algorithm-.,  was  used  in  a  real-time 
experiment  as  an  off-line,  data  quality  assurance 
system. 

The  success  of  the  initial  code  reinforced  our  belief 
that  Lisp-like  languages  were  especially  suitable  for 
exploratory  rerearch.  However,  the  need  for  very 
tight,  man-machine  coupling  was  not  well  supported  by 
the  available  mainframe  graphics.  At  about  the  time  of 
the  completion  of  the  experiment,  a  new  type  of 
hardware  was  being  brought  to  realization.  Systems 
based  on  ARPA/ONR  research  in  artificial  Intelligence. 
Generically,  these  machines  are  referred  to  as  Lisp 
machines,  since  they  incorporate  special  hardware  to 
support  the  Lisp  language.  Figure  2  presents  some  of 
the  key  aspects  of  Lisp  machine  systems  currently 
available.  Our  initial  hardware,  A  Symbolics  LM-2,  was 
delivered  in  December  of  1931.  As  significant  than  the 
raw  power,  address  space,  and  superior  mar.-machlne 
Interface  of  these  machines,  however,  is  the  powerful 
data  abstraction  constructs  of  the  language  which 
constitute  the  Flavor  system  that  is  provided  as  an 
extension  of  the  Lisp  language.  The  Flavor  system 
allows  programmers  to  develop  structures  that  are 
closely  related  to  the  physical  or  theoretical  objects 
that  the  researcher  is  trying  to  model.  Thus,  objects 
such  as  "reports"  or  “sensors"  are  manipulatable 
entities  in  the  system.  For  example,  a  track  "object" 
may  store  its  reports  as  a  list  of  the  form: 

(  REFORT-OBJECT- 1  REFORT-OBJECT-2  ...); 

adding  a  report  to  a  track  is  simply  accomplished  by 
the  statement 


spec:*:  architecture  to  support 
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Figure  2.  Advanced  Personal  Computer  Hardware 

Figure  3  shows  the  main  window  of  the  INCA  system. 
Virtually  all  user  interaction  is  via  a  mouse  and 
mousesensitive  icons  and  menus  on  the  screen.  The 
large  gridded  window  at  the  left  is  for  geographic 
displays;  it  can  be  divided  into  as  many  as  four 
separate  geographic  display  panes.  The  rectangular 
boxed  stricture  below  the  INCA  logo  is  called  a 
blackboara  and  is  central  to  the  user  interface.  The 
blackboard  is  divided  into  levels  (rows),  each  of  which 
represents  a  step  in  the  correlation  process.  Each 
level  of  the  blackboard  is  divided  horizontally  into  a 
set  of  bins;  the  logical  definition  of  the  bin  levels 
is  shown  in  Table  1.  Each  of  the  bins  in  a  level 
contains  a  definition  of  how  Its  data  were  generated 
and  also  contains  the  result  of  executing  that  defini¬ 
tion.  We  have  intentionally  separated  logically 
distinct  aspects  of  the  processing  in  order  to  give  the 
operator/analyst  the  maximum  visibility  into  and 
control  over  the  processing.  Immediately  under  the 
blackooard  is  a  set  of  menus  which  present  many  of  the 
top-level  functions  to  the  user.  These  Include  the 
functions  to  control  the  display,  to  define  and  execute 
a  bin,  and  a  set  of  manual  track  and  report  editing 
coirmands. 

Table  1.  Steps  in  Recursive  Sane  Generation 


ACTIVITY 


LEVEL 


Sci ECT  PARTITION  OF  NEW  DATA  R 


(  SEND  TRACK  1 : ADD-REPORT  REPORT  ). 

The  Flavor  system  is  a  type  of  object-oriented  pro¬ 
graming  approach,  distinguished  by  the  use  of  "message 
passing"  facilities.  Message  passing  has  the  very 
powerful  feature  that  deamons  and  triggers  can  be 
attached  to  the  abstract  data  types.  For  exauple,  a 
deamon  can  be  added  to  the  : ADD-REPORT  message  so  that 
the  track  state  is  automatically  recalculated.  These 
capabilities  collectively  support  very  flexible  imple¬ 
mentation  schemes. 

THE  CURRENT  INCA  SYSTEM 

This  section  will  present  the  tools  that  are 
currently  available  to  the  analyst  for  develooing  an 
ocean  surveillance  scene  and  for  exploring  new  correla¬ 
tion  methodologies.  This  information  will  be  presentr  i 
by  showing  several  screen  images  from  the  INCA  sys*“'  . 


GENERATE  FEASIBLE  EXTENSIONS 

TO  THE  TRACKS  IN  THE  OLD  SCENE(S)  FT 

USING  NEW  REPORTS 

GENERATl  A  SET  OF  ALTERNATE  WORLD  FS 

VIEWS,  FEASIBLE  SCENES 

SELECT  A  SET  OF  THE  FEASIBLE  SCENES  S 

TO  BE  EXTENDED  ON  NEXT  ITERATION 

Figure  4  shows  how  the  user  interacts  with  INCA  in 
order  to  generate  a  partition  of  data  on  which  he 
wishas  to  do  some  processing.  The  user  had  moused  the 
DEFINE  AND  EXECUTE  menu  item  followed  by  a  selection  of 
tne  first  bin  at  the  R  level.  A  pop-up  menu  is 
presented  that  allows  him  to  make  the  equivalent  of  a 
uata  base  query.  The  result  of  this  quary  is  displayed 
in  Figure  4.  The  user  can  interrogate  the  contacts  by 
additional  mouse  interactions  as  shown  in  Figure  5. 
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Figure  3.  INCA  Main  Window 
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Figure  4.  User  Interaction  In  Generating 
Report  Partitions 


Figure  5.  Example  of  Pop-Up  Menu  for  Access 
to  Additional  Conmands 

Figure  6  shows  the  menu  of  choices  that  the  user 
has  at  the  feasible  track  (FT)  construction  stage.  In 
this  stage,  many  feasible  tracks  are  generated,  each  of 
which  entail  the  feasible  association  of  reports  to 
tracks,  along  with  scores  measuring  the  quality  of 
association.  In  addition  to  specifying  the  bins  that 
he  wishes  to  serve  as  data  sources,  the  user  can 
specify  one  of  several  correlation  techniques.  The 
methods  currently  available  Include: 

1)  A  deterministic,  algorithm  based  on  kinematic 
feasibility  and  parametric  gates. 

2)  A  Bayesian  scheme  that  considers  both  kine¬ 
matic  and  attribute  associations.  The 
technique  Is  based  on  the  work  of  Bowman  and 
Murphy  [1]. 

3)  An  approximate  Bayesian  scheme  which  makes 
heuristic  approximations  to  the  complete 
Bayesian  scorer. 

In  the  first  o*  these  techniques,  each  repo-t  is 
associated  to  at  most  one  track.  In  the  latter  two, 
reports  may  be  associated  to  more  than  one  track, 
Kalmanfllterlng  Is  used,  and  scores  are  based  on  filter 
residuals.  Also  note  that  this  menu  contains  several 
"control  parameters",  such  as  report  parameter  gate 
values,  and  a  set  of  heuristic  tree  search  parameter 
values,  which  control  the  combinatoric  branching  during 
the  track  generation  process.  [2]  Figure  7  shows  the 
output  of  this  process  for  a  Bayesian  scoring  scheme. 

The  final  step  Is  the  generation  of  a  consistent 
scene  from  the  candidate  tracks  o.  the  FT  level,  which 
can  be  accomplished  by  uniquely  associating  reports  to 
tracks.  This  Is  done  at  the  FS  level  where  a  global 
optimization  of  the  score  Is  attempted.  Typically, 
computational  considerations  preclude  an  exhaustive 
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Figure  6.  Chords  for  Feasible  Track  Construction 


search,  and  we  apply  heuristic  search  methoas  as  well. 
Figure  7  shows  two  feasible  scenes  scores  that  have 
been  generated  with  a  heuristic  algorithm.  The  user 
would  now  select  one  or  both  of  these  to  propagate  with 
the  next  set  of  data. 

The  blackboard  serves  as  an  Interface  to  the  INCA 
data  base  wnere  raw  report.  Intermediate,  and  final 
products  are  stored.  He  may  elect  to  process  a  partic¬ 
ular  set  of  data  in  several  ways  and  the  bins  are  used 
to  store  these  results.  There  is  no  intrinsic  hardware 
or  software  limitation  on  the  number  of  scenes  or 
number  of  intermediate  results  he  can  maintain.  (The 
current  graphical  blackboard  Is  limited  to  six  scenes, 
but  will  be  modified  to  display  a  structure  that  better 
maps  to  the  tree  that  the  user  is  building.)  In 
addition  to  the  algorithmic  techniques  available  to  the 
user,  a  large  number  of  manual  and  g*aph’c  tools  are 
available.  Figure  8  shows  one  of  the  plotting  options 
available  for  examining  and  creating  data  partitions 
via  parametric  value  selection. 

MOTIVATION  AND  METHODOLOGY  FOR  INTELLIGENT  ASSISTANCE 

The  current  INCA  system  allows  the  user  to  gener¬ 
ate  tactical  pictures  using  a  wide  variety  of  tools  and 
techniques.  We  are  confident  that  this  allows  sophis¬ 
ticated  users  to  understand  better  the  Impact  of 
process  improvements  and  tu  determine  the  best  pro¬ 
cessing  strategy  for  particular  situations.  Our 
assumption  was  ’'hat  the  analysts  would  apply  very 
specific  expertise  relevant  to  each  of  the  methods  as 
they  evaluated  process  performance  and  tuned  the 
algorithms.  Since  the  results  of  this  tuning  was  to  be 
made  available  to  users.  It  was  essential  to  capture 
this  expertise.  Thus,  some  other  form  of  automated 
assistance  in  operating  INCA  was  necessary.  The 
conceptual  framework  for  merging  statistical  decision 
theory  and  expert  system  technology  has  been  explicated 
In  [3]. 
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Figure  7.  Candidate  Feasible  Scene 
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Figure  8.  Scattergram  Facility 
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The  Track  Association  algorithm  combines  a  ship’s 
underwater  sonar  and  surface  radar  pictures  in  order 
to  improve  the  quality  of  the  underwater  picture . 

Introduction 

The  deployment  of  passive  arrays  has  potentially 
greatly  improved  underwater  detection  performance, 
by  making  it  possible  to  detect  targets  at  much 
greater  ranges  than  was  previously  possible.  A 
consequence  of  this  improved  capability  is  that 
there  tend  tc  be  many  more  targets  within  the 
array's  detection  range.  This  in  turn  complicates 
the  task  of  processing  observations,  in  order  to 
derive  an  underwater  picture  consisting  of  tracks 
and  their  associated  characteristics.  The  problem 
is  made  more  acute  by  the  fact  that  the  observations 
carry  limited  information,  namely  bearing  and 
frequency. 

The  formation  of  the  underwater  picture  from  array 
observations  consists  essentially  ofi 

-  assigning  observations  to  tracks,  that  is 
deciding  which  observations  come  from  the  same 
target,  and 

-  combining  the  information  contained  in  the 
observations  in  order  to  form  estimates  of 
the  tracks*  characteristics. 

The  sonar  operator  can  associate  observations  to  form 
tracks  provided  that  there  has  been  no  break  in 
contact.  However,  he  in  unable  to  recognise  when 
two  or  more  tracks,  which  existed  at  differ- -it  times,' 
should  be  associated.  This  could  arise,  for  example, 
after  a  manoeuvre  by  own  ship,  or  as  a  result  of 
sonar  intermlttency. 


The  Track  Association  algorithm  aiffl3  to  enhance  a 
ship's  underwater  picture  in  the  following  ways: 

-  by  recoouising  that  several  track  segments 
origimte  from  the  same  target,  and 

-  by  classifying  track  segments  as  ships,  as  a  result 
of  comparing  track  segments  with  the  surface  radar 
picture.  If  there  are  a  lot  of  ships  around  then 
this  is  potentially  a  powerful  way  of  improving 
the  qua.ity  of  the  underwater  picture,  since  the 
radar  picture  contains  much  more  precise  infor¬ 
mation. 


The  Algorithm 

The  algorithm  combines  a  ship's  underwater  sonar  and 
surface  radar  pictures  into  a  single  picture.  He 
shall  refer  to  tracks  in  the  comronent  pictures  as 
segments  and  reserve  the  terra  tracks  for  tracks  in  the 
combined  picture.  A  track  therefore  consists  of  one 
or  more  segments.  The  objective  of  the  algorithm  is 
to  place  segments  from  the  same  target  in  the  same 
track,  and  segments  from  different  targets  in 
different  tracks.  The  performance  of  the  algorithm 
is  the  degree  to  which  this  can  be  achieved. 

Parameter! sat ion  of  Segments  and  Tracks 

Segments  and  tracks  are  characterised  in  the  algorithm 
by  their  estimated  solutions  «,-d  covariance  matrices, 
which  are  pararaeterlsed  in  terms  of  Reciprocal  Polar 
co-ordinates.  These  are  defined  as  follows: 

0  *  the  bearing  of  chc  target 

02  -  the  inverse  of  the  target's  range 

03  «  the  target's  bearing  rate 

0^  ■  the  relative  range  rate  of  the  target 

05  *  the  frequency  that  would  be  received  by  a 

stationary  observer  at  the  origin  in  space  and 
time  (for  sonar  segments  only! . 

All  these  quantities  arc  measured  from  a  stationary 
origin  at  or  near  the  receiving  sensor  at  a  specific 
time . 


The  reason  for  using  this  parameterisation  is  that 
the  covariance  matrix  provides  more  realistic 
confidence  intervals  for  the  estimates  of  sonar 
aegments,  in  particular  their  estimated  range.  The 
parameterisation  therefore  provides  a  more  accurate 
estimate  of  the  penalty  incurred  (in  terms  of  good¬ 
ness  of  fit)  when  segments  are  combined  to  form  a 
track.  For  a  general  discussion  on  Reciprocal  Polar 
co-ordinates  see  (1). 


Generation  of  Component  pictures 

During  the  development  of  the  algorithm  we  have 
generated  the  component  underwater  and  surface 
pictures  in  the  following  way.  First  a  scenario 
generator  program 
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Initially,  we  utilized-  one  of  the  expert  system 
building  tools  emerging  from  AI  laboratories  to  "cap¬ 
ture"  the  analysts  actions.  The  0PS5  system  developed 
at  CaTegi  e-Mel  Ion  University,  was  chosen.  We  have 
been  utilizing  0?S5  in  this  development  activity  for 
approximately  18  months. 

Since  many  of  the  features  of  the  INCA  system  are 
in  continuous  development,  we  did  not  process  signifi¬ 
cant  amounts  of  real  data  and  the  development  of  a 
sophisticated  rule  set  was  impossible.  As  a  test  of 
0PS5,  we  addres-.ed  the  simpler  problem  of  creating  a 
syste.i’  controller  that  is  capable  of  running  the  INCA 
system,  i.e.,  executing  a  sequence  of  actions  and 
allocating  machine  resources.  Eventually,  this  process 
will  be  expanded  to  include  more  complex  decision 
tasks. 

Up  to  this  time,  we  have  successfully  built  and 
tested  a  control  system  which  uses  simple  resource 
allocation  procedures  (e.q.,  it  always  deletes  the 
oldest  contending  entity)  and  can  differentiate  simple 
situations  and  select  algorithms  for  feasible  track 
generation. 
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generates  t he  number  and  kt  rvnatics  of  targets 

determines  the  positions  of  targets  at  specified 
times 

derives  raw  observations  of  the  targets  consistent 
with  the  performance  of  ships'  passive  s;cnar  and 
radars.  (A  sonar  oosorvation  consists  of  a  bearing 
and  a  frequency,  while  a  radar  observation  cc-*  .  •ws 
of  a  bearing  and  a  range) .  , 

a  isiqr.s  the  observations  to  segments.  0bse*v4t‘  in 
from  the  same  target  are  assigned  to  *1.2  same 
segment  provided  that  they  are  made  by  the  same 
sensor  and  there  has  been  no  break  of  contact. 

The  estimated  solution  and  covariance  matrix  of  each 
segment  is  then  calculated  for  each  segment  using  ^ 
catch  filter.  This  program  minimises  the  sum  of 
squares  of  residuals  between  fitted  and  observed 
values.  Note  that  not  all  sonar  bearing  observations 
are  processed  since  they  cannot  be  assumed  independ¬ 
ent.  Instead  the  first  and  last  bearings  plus  bearings 
derived  whenever  targets  change  beam  are  processed. 
Note  also  that  sonar  segments  may  have  two  solutions 
resulting  from  the  bearing  ambiguity  of  sonar 
observations. 

Assur.pt  ions 

The  initial  implementation  of  the  algorithm  has  made 
a  number  of  assumptions,  as  follows: 

*  each  target  travels  with  constant  velocity, 
consequently  no  attempt  is  made  to  detect 
manoeuvres,  and 

-  each  target  emits  a  simple  steady  sonar  frequency 

The  assumptions  were  made  to  simplify  the  initial 
implementation,  and  will  be  relaxed  in  the  next 
phase  of  development.  It  was  considered  preferable 
to  solve  the  fundamental  problems  before  being 
concerned  with  complicating  factors.  The  scenario 
generator  is  consistent  with  the  above  assumptions. 

Objective  Function 

The  assignment  of  segments  to  tracks  is  determined 
by  minimising  an  objective  function.  The  objective 
function  is  defined  by 

C  *  S  ♦  «xT 

where  S  is  the  residual  sum  of  squares  between 
fitted  and  observed  values,  taken  over  all  tracks, 

T  is  the  number  of  tracks  and  o<  is  a  constant.  The 
value  of  <k  is  chosen  to  balance  two  conflicting 
factors:  the  desirability  of  segments  from  the  same 
target  being  associated  and  the  undesirability  of 
associating  segments  from  different  targets. 

Assignment  of  Segments  to  Tracks 

It  would  be  too  costly,  in  terms  of  computing  time, 
to  determine  the  absolute  minimum  of  the  objoctiva 
function  at  specific  times,  so  an  approach  which 
tries  to  improve  on  the  existing  assignment  of 
segments  to  tracks  is  used. 


The  assignment  of  segments  to  tracks  is  reviewed 
periodically.  At  each  review  time  a  number  of  sets  of 
segments  are  selected.  Each  set  forms  the  basis  of  a 
review,  and  consists  of  segments  which  are  assumed  to 
have  come  from  distinct  targets.  From  our  assumptions, 
the  sonar  or  radar  segments  current  at  a  given  time 
satisfy  this  property.  Sets  are  selected  so  that  they 
include  all  seaments  current  at  the  review  time,  all 
segments  which  started  after  the  previous  review  time 
(in  order  to  ensure  that  they  are  assigned  to  tracks 
at  the  earliest  opportunity) ,  and  some  (or  all) 
historical  segments. 

The  method  of  reviewing  the  assignment  of  the 
segments  in  a  sot  to  tracks  is  as  follows^ 

a.  The  characteristics  of  the  reduced  tracks,  formed 
by  removing  the  *  segments  being  reviewed  from  the 
current  tracks,  are  determined. 

b.  The  increase  in  the  objective  function  (or  cost) 
resulting  from  assigning  each  segment  to  each 
reduced  track  is  determined  (i.e.  the  increase  in 
residual  sum  of  squares),  as  is  the  cost  of  not 
assigning  a  segment  to  any  existing  reduced  track 
ii.e'.pi). 

c.  Having  worked  out  all  the  costs,  the  assignment  of 
segments  to  tracks  incurring  minimum  total  cost  is 
determined.  This  is  done  with  the  restriction  that 
at  most  one  segment  can  be  assigned  to  any  track 
(in  order  to  be  consistent  with  our  definition  of 
a  set,  of  segments)  . 

cl.  Finally  the  tracks  are  updated  by  assigning  the 
segments  to  tracks  in  the  manner  determined  in  c. 
above.  Segments  which  were  not  assigned  to  reduced 
tracks  form  single-segment  tracks. 

After  completing  the  ;et  reviews  at  each  review  time 
the  tracks  are  examined  to  see  whether  any  should  be 
merged.  Two  tracks  are  roeiged  if  this  results  in  a 
decrease  in  the  objective  function,  and  this  happens 
if  the  increase  in  the  residual  sum  of  sguares  caused 
by  the  merge  is  less  than  the  constant  pc. 

Figure  of  Merit 

We  have  defined  a  figure  of  merit  (FOM)  which  takes 
values  between  -1  and  +1 ,  such  that  M  is  achieved 
when  there  is  perfect  association,  and  that  the  FOM 
decreases  as  the  performance  of  the  algorithm 
deteriorates. 

Performance  or  Algorithm 

The  algorithm  has  been  tested  with  several  scenarios, 
each  generating  approximately  50  segments  over  a  12 
hour  period.  The  FOM  for  all  these  scenarios  is  +1  at 
every  review  time.  It  is  worth  noting  that  several 
segments  (approximately  10%)  consist  of  a  single 
sonar  observation  (bearing  and  frequency)  only. 

In  order  to  achieve  this  performance  the  method  of 
computing  the  cost  when  assigning  a  segment  to  a 
reduced  track,  and  the  method  of  calculating  the 
characteristics  of  a  track  from  its  component  segments 
have  undergone  considerable  refinement.  An  iterative 
process  is  nov  used  which  forms  a  trial  solution,  and 
then  re linearises  about  this  trial  solution  in  order 
to  form  an  improved  solution . 
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Enhancements 


Ouring  the  next  year  we  plan  to  relax  the  assumptions 
made  during  the  initial  implementation.  We 
anticipate  including  the  following  featuresi 

*  Multiple  sonar  frequencies  from  a  target 


Target  manoeuvres 

Pat*  from  other  platlorms. 


Acknowledgement 

The  development  of  the  Track  Association  algorithm 
has  been  supported  by  the  Admiralty  Surface  Weapons 
Establishment  (ASWE)  of  the  British  Ministry  of 
Defence.  1 


Reference 

11)  The  Use  of  Reciprocal  Polar  Co-ordinates  in 

Passive  tracking,  A.  C.  Bamford,  E.M.L.  Beala, 
J,  t*e,  S.  Patel,  Proceedings  of  6th  HIT/ONJt 
Workshop  on  Command  and  Control,  1983. 


190 


FURTHER  RESULTS  IN  MULTI  PLATFORM  CORRELATION  AND 
GRIDLOCK  IN  THE  NAVAL  BATTLEGRCUP 


M.A.  KOVACICH 


COMPTER  Research,  Inc.,  100  Corporate  Place,  Suite  B 
Vallejo,  California  94590 

rtf'/-  fc  ,i , /■  s 

J 

the  force  tracking  picture 
developed  by  aggregate  force  sensors  be  accurate,  nonredundant ,  and 
registered  appropriately  in  each  platform's  coordinate  frame.  This 
report  discusses  further  results  in  the  devel'-oment  of  systems,  de¬ 
noted  Force  Track  Alignment  systems,  that  satisfy  .hese  requirements. 

The  evaluation  methodology  is  discussed  and  performance  data  is 
presented  that  demonstrates  the  effectiveness  of  an  FTA  system  that  is 
composed  of  a  Kalman  bias  estimation  algorithm  coupled  to  a  sequen¬ 
tial,  track  history  dependent  correlation  process.  ^ - , _ 
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ABSTRACT .  Battlegroup  require*  that 


INTRODUCTION 


Hult ipl«t( orm  correlation  and  gridlock,  denoted  in  thia 
report  aa  Force  Track  Alignment  (FTA),  designates  the 
diatributed  functional  capability  of  the  Haval  battlegroup 
to  align  the  local  tracking  picture  of  each  platform  with 
the  cumulative  tracking  picture  developed  by  the  other  plat* 
fora*  in  the  fbree.  Alignment  requires  accurate,  bias  free 
coordinate  conversions  between  platforms  and  reliable 
interplatfor^  track  pairings  in  older  to  present  a  one  to 
one  representation  of  the  tactical  environment  to  each  plat* 
fora  in  the  coordinate  frame  of  each  platform.  Nhen  viewed 
as  part  of  the  overall  C3  system  of  the  battlegroup,  FTA  la 
a  surveillance  function  in  support  of  bsttlegroup 
operations.  In  current  fleet  operations,  unfortunately, 
this  support  is  limited  und  sometimes  counterproductive  due 
to  the  fragility  and  unreliability  of  the  gridlock  and  cor* 
relation  processes. 

The  Naval  community  has  a  number  of  ongoing  projects  to 
improve  the  FTA  process  in  order  to  improve  bsttlegroup 
coordination  and  to  realise  new  engagement  strategies,  par* 
ticularly  Intership  fire  control.  The  Gridlock  Demonstra¬ 
tion  System  (GDS),  developed  at  Johns  Hopkins.  Universi¬ 
ty/Applied  Physics  Laboratory,  ia  currently  undergoing 
At-Sea  testing.  The  FTA  project,  to  be  described  in  this 
report,  is  under  NAVSEA  sponsorship  and  is  currently  under¬ 
going  Landbased  testing.  In  the  near  future,  the  JTIDS  com¬ 
munication  link;  with  its  accurst*  relative  navigation 
(RELNAV)  capability,  and  NAVSTAR  GPS;  with  its  accurate  geo¬ 
detic  navigation  capability,  will  further  contribute  to  the 
capability  of  the  battlegroup  to  operate  as  s  unit  with  sur¬ 
veillance  assets  on  one  platform  directly  supporting  engage¬ 
ment  requirements  on  other  platforms. 

The  purpose  of  this  report  is  to  discuss  some  results  ob¬ 
tained  in  developing  FTA  algorithms  for  deployment  ir.  a 
pre-JTTDS  and  pre-NAVSTAR  GPS  environment.  The  report  is  in 
two  p*-ts.  The  first  part  places  FTA  in  the  contest  of  the 
Battlegroup  C3  system.  The  second  part  details  the  evalua¬ 
tion  methodology  and  the  performance  results  obtained  in 
developing  an  FTA  system. 


FTA  AS  PART  OF  BATTLEGROUP  C3 


For  the  purpose  of  this  discussion,  Battlegroup  C3  will  be 
viewed,  in  the  sense  of -Morgan  (1),  aa  a  nested,  distributed 
collection  of  Perception  -  Assessment  -  Decision  Taking  - 
Execution  (PADE)  control  processes,  wherein; 

Perception  is  the  process  of  sensing  the  envi¬ 

ronment  and  developing  a  represen¬ 
tation  of  the  state  of  the  environ¬ 
ment  , 

Aumatnt  is  the  process  of  relating  percep¬ 

tions  to  the  objective  of  the 
control  process. 


Decis.on  Taking  is  the  process  of  deciding  on  a 
course  of  action  based  on  the  as¬ 
sessment. 

Execution  is  the  process  of  carrying  out  the 

decision  and  interacting  with  the 
environment . 

Generally, f  each  PADE  control  process  operates  on  a  charac¬ 
teristic  Mae  scale  over  which  the  objective  is  constant  and 
ia  coordinated  with  other  PADE  control  processes  that  have 
overlapping  characteristic  time  scales,  objectives,  or 
resource; requirements. 

Within  this  view  of  Bsttlegroup  C3,  FTA  is  essentially  an 
element  of  the  Perception  process  and  directly  supports  the 
assessment  and  coordination  activities  of  the  battlegroup. 
rTA,  through  the  process  of  intership  track  correlation, 
augmentation  of  remote  data  with  local  data  on  co. related 
trackr ,  link  reporting  responsibility  logic,  and  intership 
bias  removal  strives  to  maintain  a  nonredundant  force  track 
file  that  contains  the  most  complete  and  accurate  track  data 
generated  by  force  sensors  and  registered  in  each  ships  co¬ 
ordinate  frame.  Force  coordination  is  supported  by  FTA 
since  each  platform  perceives  the  same  environment  by  means 
of  the  single,  established  force  track  tile. 

Improvements  in  FTA  will  lead  to  performance  improvements  in 
a  number  of  PADE  control  process.  To  choose  one,  consider 
the  timing  relationships  depicted  in  Figure  1  for  the  PADE 
control  process  that  leads  from  Initial  radar  detection  by  a 
surveillance  radar  to  lock-on  by  the  fire  control  radar. 
Suppose,  also,  that  the  initial  radar  detection  occurs  on 
the  remote  platform.  (This  situation  is  likely  to  occur  in 
future  battlegroup  operation  In  which  the  remote  platform 
carries  the  exceedingly  accurate  AN/SPY-1  radar  and  ownship 
carries  a  typical  surveillance  radar.) 


TIME 


LOCAL  TRACK  DATA  STA8L* 


REMOTE  TRACK  DATA  STABLE 
/Tl 

A  THREAT  ASSESSMENT  STABLE 


D 

E 


/  OECISION  TO  DESIGNATE  STABLE 

©/ . 


LOCK  ON  STABLE 


INITIAL  FIRE  CONTROL  FIRE  CONTROL 

DETECTION  LOCK  ON  LOCK  ON 

(GOOD  ALIGNMENT)  (POOR  ALIGNMENT) 


riGURE  1.  Tl.lnq  Diaqra.  for  PADE  Control 
Froceaa  leading  fro.  Radar  Detection  to  Fire 
Control  Redar  loch-On. 
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In  current  fleet  operations,  wherein  gridlock  and  correla¬ 
tion  are  not  always  relleole,  the  response  time  in  the  PADS 
control  loop,  from  initial  detection  to  fire  control 
lock-on,  is  slowed  due  to  delays  in  the  threat  assessment 
process  and  the  fire  control  search  sequence.  Threat  as¬ 
sessment  is  slowed  in  areas  of  overlapping  surveillance  cov¬ 
erage  since  operators  are  not  confident  of  correlation  deci¬ 
sions.  oo  the  two  radially  inbound  tracks  truely  represent 
two  vehicles  or  only  one  vehicle  doubly  cepr*»«nt*J  or  actu¬ 
ally  three  vehicles  because  of  a  ■isscortelation  decision? 
Once  enough  corroborating  evidence  ia  obtained  (e.g.,  over 
voice  link,  IPP,  visual  contact,  etc.)  to  Indicate  that  the 
remote  track  is  a  definite  threat  (i.e.  the  threat  asses?  • 
sent  is  declared  stable,  as  shown  in  the  figure),  then  a  re¬ 
sponse  is  conside-ed.  A  typical  response  is  to  lock-on  to 
the  threat  with  the  fire  control  radar  aa  a  prelude  to  actu¬ 
ally  launching  a  Missile.  Unfortunately,  because  in»ership 
bias  removal  has  historically  been  to  poor,  tactics  have 
evolved  that  require  the  fire  control  aearch  sequence  be  in¬ 
itiated  only  on  local  t  ack  data.  In  many  cases*  the  -•arch 
time  ia  iapractically  high,  thereby  tying  up  a  valuable 
resource  when  the  search  sequence  ia  initiated  on  remote 
data  that  suffers  fro*  large  gridlock  and  aiiaal ignrent 
errors.  Therefore,  the  fire  control  aearch  sequence  is 
delayed  until  local  track  data  becoaiea  available.  Once  a 
local  track  is  established  and  determined  to  be  the  same  aa 
the  remote  track,  then  the  search  sequence  proceeds  «nd  ul¬ 
timately,  in  moat  instances,  leads  to  a  stable  lock-on. 

As  the  accuracy  and  reliability  of  the  alignment  process 
Improve,  s  number  of  delays  in  this  PACE  control  process  can 
be  removedt 

®  the  time  required  for  threat  assessment  can  be  re¬ 
duced  since  the  correlation  decisions  are  more  re¬ 
liable  leading  to  accurate  threat  assessments 
without  aa  much  coi roborating  evidence; 

@  therefore,  the  decision  to  lock-on  can  be  made 
sooner,  and 

(3)  the  fire  control  aearch  algorithm  need  not  wait 
for  local  track  data  since  the  interahip  biases 
are  removed  Ly  the  alignment  process. 

Effectively,  when  the  alignment  process  is  accurate  and  re¬ 
liable,  each  platform  can  respond  to  the  tactical 
enviornment  aa  if  each  remote  track  is  as  accurate  and  reli¬ 
able  **  a  1«c«J  track.  *be"  th.'e  capability  i«  established 
in  the  battlegroup,  new  engagement  tactics  (i.e..  Decision 
Taking  and  Execution  processes)  become  possible, 
particularly  the  capability  to  launch  a  missile  on  pure]*' 
remote  track  data  ("Launch  on  Remote*  capability).  Theae 
engagement  tactics,  generally  denoted  as  intership  fire 
control,  will  be  particularly  effective  with  the  introduc¬ 
tion  of  the  AM/SPY-1  radar  which  is  capable  of  highly 
accurate  tracking.  ETA  will  be  a  critical  feature  in 
realising  the  full  potential  of  this  and  other  systems  in 
•attlegroup  C3. 


FORCE  TRACK  ALIGNMENT 


Turning  to  a  diacuaalon  of  I  * A  itself,  FTA  designates  the 
Interrelated  set  of  algorithms  that  automatically  and  con¬ 
tinuously  align  the  local  track  picture  and  the  force  track 
picture.  Aa  such,  PTA  correlates  local  and  remote  tracks 
received  from  all  platforms  on  the  data  link,  and  estimates 
and  coitects  biases  between  each  platform  and  the  Gridlock 
Reference  Unit  (GRU)  which  establishes  the  standsrd  coordi¬ 
nate  system  for  the  interchange  of  track  data.  The  flow  of 
data  in  an  FTA  system  is  shown  in  Figure  2  '  and  discuss'd 
nor?  fully  in  Rovaeich  (21. 

Some  underlying  assumption  made  in  the  design  of  PTA  Cor  de¬ 
ployment  In  the  1984-1990  time  frame  are  that  each  platform 
maintain  a  single  set  of  bias  estimates  with  the  GRU  vice 
bias  estimates  with  all  platforms,  and  that  FTA  employ  a 
single  correlation  algorithm  to  satisfy  the  dual  require¬ 
ments  of  generating  mutual  tracks  with  the  GRU  for  gridlock 
purposes  and  generating  ritual  tracks  with  remotes  from  all 
platforms  in  order  to  maintain  an  accurate,  nonredundant 
force  track  file.  The  technological  environment  is 
pre-JTIDS  and  NAVSTAR  GPS  and  consists  of  current  fleet 
radars  and  navigation  systems  and  the  current  dsta  link 
(Llnk-11 ) . 


EVALUATION  METHODOLOGY 


The  objective  of  the  project  is  to  find  a  »<x  of  algorithms 
and  algorithm  thresholds  such  that  PTA  system  performance  is 
maximized.  The  objective  ia  a  formidable  one  given  the  va¬ 
riety  of  algorithm  possibilities,  the  complex  feedback  in¬ 
teractions  in  FTA,  and  the  large  number  of  system  thresh¬ 
olds. 

To  manage  this  problem,  seven  candidate  PTA  systems  were 
designed  which  reflected  various  choices  from  s  spectrum  of 
possibilities  (e.g.,  single  pass  vice  multipass  cor relatLon> 
constant  g;ln  vice  Kalman  bias  estimation,  uncoupled  vice 
coupled  systems,  and  PTA  systems  with  and  without  oper.  loop 
initial ization  algorithms).  A  Monte  Carlo  simulation  pro¬ 
graft  was  written  in  order  to  stress  the  various  PTA  ayetems, 
tc  examine  performance  levels,  and  to  optimize  system 
thresholds.  The  approach  to  optimising  the  system  thresh¬ 
olds  was  to  find  the  set  of  thresholds  in  which  the  PTA 
system  exhibited  acceptable  correlation  and  bias  estimation 
performance  over  the  largest  range  of  track  densities  in  the 
presence  of  severe  irtership  biases  (10  nmi  gridlock  1  error, 
4  deg  azimuth  bias,  and  15  to  50  Knots  of  Intership  drift). 
Standards  for  acceptable  FTA  performance  were  defined  and 
specified  in  terms  of  allowed  levels  of  dual  designation 
(leas  than  51),  gridlock  accuracy  (less  than  .75  nmi);,  ini- 


P1GURE  2.  Flow  of  Track  Data  in  the  Force 
Track  Alignment  System.  Only  Closed  Loop 
Stage  is  shown.  Open  Loop  Stage  generates 
Initial  Bias  Estimates  and  corrects  Degraded 
Closed  Loop  Estimations. 
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i tat; ion  tiat  (last  than  5  minuses),  and  stability  across 
gatlon  updatas  (lass  than  5%  incorract  decorrelation*), 
outcome  of  this  phase  of  tha  avaluation  prograa  to  aaxi- 
•  aach  FTA  system  is  prasantad  in  Figure  1.  Tha  com  and 
iwter  load  requirement*  of  aach  system  are  plottad 
net  tha  maximum  track  density  for  which  tha  FTA  systaa 
afiad  tha  accaptabla  performance  standards.  Each  PTA 
and  all  performance  dat»  ara  datailad  in  (41. 
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FIGURE  5.  Alignment  Parforaanca  of  Loosaly 
Couplad  FTA  Systaa  that  aaploya  Multi-Pass 
Corralation  and  Kalman  Bias  Eatiaatlon  (FTA 
•191.2). 


datail  in  Millar  (31.  Figures  4  and  5  prascnt  tha  parfor¬ 
aanca  of  tha  bast  par  torn ing  FTA  systaa  (PTA6191.2)  and  the 
poorast  performing  systaa  (FTAO),  respectively.  Tha  figures 
plot  tha  scan  averaged  intertrack  separation  versus  scan 
period.  Note  tha  cosparativa  stability  of  FTAS191.2.  Tha 
FT AO  suffered  because  of  aisncorrelation*  that  drew  off  tha 
bias  estimate*  leading  to  further  sisacor relations  and  bias 
errors.  Tha  sisscorrclations  and  inability  to  later 
dacorralata  ►he  incorract  decorrelations  ears  due  mainly  to 
tha  lack  of  internal  coupling  between  tha  correlation  and 
biaa  estimation  procaas. 

As  s  result  of  tha  simulation  tasting  and  system  performance 
using  tha  live  radar  tape#  ^^8191.2*  was  recommended  for 
implementation.  Tha  next  section  provides  a  description  of 
this  system. 


DESCRIPTION  OF  RECOMMENDED  PTA 
SYSTEM 


MC  70  21  V  J 

i 

FIGURE  Maximum  Track  Density  for  which 
the  PTA  System  demonstrated  acceptable  per¬ 
formance  (X  10“*  tracks/NHI* ) 


Tha  algorithm  features  of  tha  recommended  FTA  systaa  will  be 
discussed  in  terss  of  Initialisation*  Correlation*  Biss  Es¬ 
timation  and  Internal  Couplit.9* 

Inltialixatlon 


f 

•  next  stage  in  the  evaluation  program  was  I  to  exercise 
ch  FTA  system  with  live  radar  data.  A  radar  data  tape  was 
ovided  by  Applied  Physics  Lab/Johns  Hopkins  University  and 
ntained  time  synchronous*  smooth  track  data  from  two 
/SPS-39  radars*  one  at  APL/JHU  and  the  other  at  the  Naval 
search  Lab  34  miles  sway.  Tha  data  tape  Is  described  in 


Tha  open  loop  Initialisation  algorithm  employs  a  two-track 
pattern  matching  algorithm  that  1m  used  to  provide  an  ini¬ 
tial  estimate  of  the  intership  biases  and  used  In  correcting 
bias  estimates  when  the  closed  loop  portion  of  .TA  degrades 
through  destructive  feedback  of  miaacorrelatlons  to  the  bias 
filter.  The  algorithm  searches  for  local-local  track  pairs 
whose  spatial  separation  equals,  within  track  noiss*  the 
separation  between  remote-remote  track  pairs.  (Intertrack 
spatial  separation  between  local  tracks  and  between  the  cor¬ 
responding  remote  tracks  la  translation  and  rotation 
invariant  ao  la  a  valuable  indicator  that  two  local-remote 
track  pairs  are  correlatlble. )  Additional  correlation 
checks  are  made  on  the  two  local-remote  pairs  (e.g.  identi¬ 
fication*  height  and  velocity  testa)  before  final! ring  the 
correlation.  The  set  of  correlations  determined  in  this 
manner  generate  the  initial  sat  of  Innovations  for  the  bias 
filter  or  the  needed  correction  for  the  degraded  bias  fil¬ 
ter. 


FIGURE  4.  Alignment  Performance  for  Uncoupl¬ 
ed  FTA  System  that  employs  Single  Paas  Corre¬ 
lation  and  Constant  Gain  Bias  Estimation  (FTA 
0). 


FIGURE  4.  Two-Track  Pattern  Matching. 
(LI * R1 )  and  (L2*R2)  are  correlated  o.ily  if  DL 
and  DR  agree  within  Track  Noiss. 
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Cor relation 

Tti*  correlation  algorithm  carries  out  a  aaquantial,  two  way 
(local- to- raaot*  and  ramota- to-local }  correlation  procaaa 
and  makaa  correlation  deciaiona  on  local- to-many  ramota 
aabiguitiaa,  ramota- to-many  local  ambiguities,  and  many 
local - fco- many  reaote  .mbigui tiea .  (Currently*  the  algorithm 
makaa  correlation  declalo ra  on  2x2  aatria  aabiguitiea  only. 
Halt  deciaiona  are  declared  on  higher  o.der  aatria 
aabiguitiea.)  The  correlation  factor  uaeJ  to  weight 
potential  pairings  and  used  in  decorrelation  processing  for 
established  pairings  (mutiala)  incorporates  track  positional 
histories  and  appears  in  Kovacich  (5).  The  correlation 
factor  is  recursively  calculated  and  follows  directly  froa 
Bayes  rulet 


CO(tl)  *  HCO**/Di.(ti),55'(ti)l 


(LHCQ(t)-l  )) 


(L-lHCOItj.,))*! 


WHERE  I 


L  *  P  IDLUjj.Duttij/NO  CORR.Ej'l  tj-1  )  ,Dg<  tt-i  11 


•  (*»,EXP|-DV(J<ra)l) 


The  advantage  of  thia  definition  of  the  innovation  sequence 
for  the  asiauth  filter  is  that  it  is  translation  invariant* 
thereby  eliminating  any  coupling  o'  the  translation  filter 
to  the  asiauth  filter.  Error*  c:  u«nsients  In  the  tranala- 
tion  filter  have  no  effect  on  the  asiauth  bias  eatiaate.  On 
the  ot\tr  hand,  the  translation  filter  is  coupled  to  the  as* 
iauth  filter  through  the  aeasureaent  covariance  Matrix, 
thereby  treating  the  asiauth  filter  as  a  noise  process. 

Coupling 

The  bias  filter  and  correlation  process  are  coupled  in  two 
ways.  First,  the  correlation  procejs  uses  the  biss  filter 
covariance  aatrices  to  slse  correlation  gates  and  to  update 
the  correlation  pcobsb* litlea.  Second,  the  bias  filter  is 
coupled  to  che  correlation  process  in  that  only  rutuala 
whose  correlation  probability  exceed  a  certain  threshold  are 
used  in  deteralning  the  innovations.  This  coupling  aspect 
has  proven  to  be  very  effective  in  generating  a  stable  force 
track  picture. 


SUMMARY 


This  report  discussed  further  results  obtained  in  the  design 
and  evaluation  of  PTA  systcre.  The  relationship  of  VTA  to 
Battlegroup  C*  and  the  evaluation  methodology  were  diacuseed 
and  features  of  the  racommanded  PTA  ayatem  were  presented. 
The  recommended  system  Is  currently  undergoing  Land-based 
testing  in  an  operational  Naval  Command  and  Control  program. 
At-Sea  testing  is  to  follow. 


D  •  SEPARATION  BETWEEN  DL( tt ) , Dr ( t4 ) 
tr2  -  SON  Of  LOCAL  AND  REMOTE  TRACE  VARIANCES 
R  -  SCALING  CONSTANT  REFERENCES 


DL<tl>»DR(ti)  -  LOCAL,  REMOTE  POSITION  VECTOHS 
(SMOOTHED)  AT  TIME  ti 


C0(O)  *  .5 


Dl.»  Kstlaatlon 

The  bln  filter  consist,  of  two  gaint-lialted  Kalaan  £11- 
ttcsi  1)  th.  trsnslstior  fiitsr  which  ssti.stss  t.is  lati¬ 
tude,  longitude,  lstitud.  velocity,  snd  longitude  velocity 
hiss*,  snd  2)  th.  asiauth  filter  which  eatiaatos  th. 
interchip  ssi.uth  biss.  Both  filters  perfor.  nsn.uv.r  de¬ 
tection  with  gains  reset  in  order  to  adept  ro  sudden  shifts 
in  intsrship  bisssa. 
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Correlation  and  Gridlock.  Proceedings  of  the  5th 
HIT/ONR  Norksnop  on  C3  Systeas.  LIDS-R-1267.  Deceaber 
19*2. 

(3 )  Hiller,  J.T.  GrlJlock  Data  Analysis  -  Initial  MRL/APL 
Data.  P2E-0-47B  ATL/JHU.  22  June  19B0. 

(4)  Force  Track  Alignaent  Systsa  Selection  Report.  COHFTER 

Research,  Inc,  Report  Ho.  10R1E-2400-1.  1  June  19*3. 

(51  Kovacich,  H.A.  An  Apostsrlor  Approach  to  the 
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ceedings  of  the  4th  H7T/ONR  Worksho!  on  Coec.no  and 
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The  innovation  sequence  for  thn  translation  filter  ie  the 
average  lat/long  separation  vector  between  the  Xocal/reaote 
tracks  asking  up  the  autuals  held  with  the  GRU.  The  innova¬ 
tion  aeguence  for  the  asiauth  filter  is  the  average  angular 
separation  between  line  aegaents  connecting  locei  tracks  and 
line  aegaents  connecting  reaote  tracks.  See  figure  7. 


FIGURE  7.  Definition  of  Innovation  for  Asi¬ 
auth  Filter. 
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Abstract 

J^~vnr  tactical  force  data  links  can  provide  accu¬ 
rate  relative  navigation  utilizing  signal  propagation 
delays,  transformed  into  corrections  to  dead-reckoned 
relative  position  estimates  by  a  decentralized 
extended  kalitan  filter.  The  accuracy  of  this  relative 
navigation  can  be  exploited  to  achieve  inleiship  sen¬ 
sor  alignment  among  the  participating  units.  In  cer¬ 
tain  common  geometric  situations,  a  simple  range-only 
triangulation  can  be  used.  The  results  obtained  by 
this  method  have  accuracy  comparable  to  more  sophisti¬ 
cated  filtering  techniques,  at  a  considerable  reduc¬ 
tion  in  computational  time. 

Introduction 


n  primary  requirement  for  effective  battle  force 
coordination  is  the  establishment  of  a  united  picture 
of  the  locations  and  actions  of  all  militarily  signif¬ 
icant  oojects  (tracks)  in  the  tactical  environment. 

In  modern  battle  groups,  with  numerous  units  reporting 
surveillance  information  derived  from  diverse  sensors, 
this  requirement  is  frequently  not  met.  Principal 
impediments  to  establishing  a  coordinated  track  pic¬ 
ture  arc  the  inability  of  reporting  units  to  locate 
themselves  precisely  in  a  consensual  local  navigation 
grid  (gridlock);  low  data  rates  and  unreliability  of 
the  communications  links;  misaligruent  among  sensors 
on  different  platforms;  and  the  inherent  innaccuracies 
of  the  sensors  themselves,  including  operator  .'-d 
tracker  errors. 


The  use  of  frequency-agile,  hirh  data-rate  UHF 
links,  su>  h  as  JTIDS,  affords  an  opportunity  to  reduce 
the  effects  of  poor  gridlock  and  of  slow,  unreliable 
communications.  Furthermore,  the  techniques  involved 
are  independent  of  the  sensor  errors  experienced  by 
participants.  As  a  result,  mathematical  techniques 
can  be  used  to  correct  for  sensor  misalignments, 
reducing  sensor  biases  to  the  point  of  insignificance 
when  compared  to  the  random  errors  inherent  in  *he 
sensors  themselves. 


The  t'rst  section  below  outlines  the  method  by 
which  the  Gridlock  Problem  is  solved  by  UHF  Relative 
Navigation.  The  second  section  proposes  a  highly 
accurate  technique  for  correcting  sensor  misaligrenents 
among  platforms.  The  third  section  contains  an 
analysis  of  the  remaining  error  in  sensor  alignment, 
due  to  all  factors  involved,  including  remaining 
gridlock  errors  and  sensor  system  inaccuracy. 

Relative  Navigation 

The  tactical  force  data  links  using  UHF  signals 
can  measure  signal  propagation  delays  for  calculating 
ranges  accurately  enough  to  maintain  good  relative 
navigation  (Rel  Nav).  An  example  of  such  a  system  is 
the  Joint  Tactical  Information  Distribution  System 
(JTIDS). 


A  UHF  link  Rel  Nav  process  employs  two  sets  ol 
data.  The  primary  data  are  the  outputs  of  on-board 
navigation  sensors  such  as  inertial  or  heading/speed 
systems,  together  with  precise  local  clocks.  The 
second  set  of  data  are  "pseudoraages ,"  the  propagation 
delays  for  Precise  Participant  Location  Information 
(PPLI)  messages,  as  measured  by  the  recipients.  These 
data,  which  include  clock  biases,  arc  used  t o  maintain 
a  model  of  errors  in  the  primary  data.  These  errors 
are  calculated  in  JTIDS  by  an  18-state  Kalman  filter 
maintained  by  each  participant.  Since  each  PPLI 
sturce  reports  his  own  position  and  time  variances, 
these  filters  can  be  adaptive,  and  feedback  is  reduced 
by  a  convention  that  prohibits  the  use  of  PPLI  mes¬ 
sages  with  reported  variances  greater  than  the  recipi¬ 
ent's  variances.  To  allow  for  both  absolute  and  rela¬ 
tive  navigation  to  be  passed  on  the  link,  an  arbi¬ 
trarily  designated  Relative  (U,  V)  Grid  is  maintained 
by  the  participants.  The  geodetic  positioning  and 
orientation  of  this  grid  is  updated  by  all  units  when¬ 
ever  any  participant  obtains  a  geodetic  fix  of  his  own 
position.  The  increased  accuracy  of  his  geodetic 
position  report,  as  indicated  in  his  PPL!  messages, 
allows  other  participants  to  improve  their  estimates 
of  their  own  geodetic  position,  either  by  direct  prop¬ 
agation  time-delay  measurement  or  by  relating  the  new 
geodetic  information  to  the  known  position  of  the  PPLI 
source  relative  to  the  recipient.  This  latter  method 
allows  the  use  of  relayed  (not  line-of-sight)  PPLI 
messages  for  improving  absolute  position  estimates. 
Clock  synchronization  is  enhanced  by  Round-Trip  Timing 
(RTT)  message  exchanges  (see  Figures  1  and  2). 


*  •  _  . 
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Figure  2.  Round-Trip  Tiding 


A  critical  factor  in  UHF  link  Rel  Nav  ia  the 
exclusion  of  data  from  sensors  auch  aa  radar.  Thia 
exclusion  allova  the  aeparate  computation  of  sensor 
biaa  by  each  tactical  force  unit,  referenced  to  the 
common  Rel  Nav  grid.  The  high  levels  of  accuracy  in 
Rel  Nav,  combined  with  elimination  of  large  aenaor 
azimuth  biases,  allow  for  much  improved  correlation/ 
decorrelation  criteria  and  a  unified  force  track 
picture.  Because  of  the  independence  of  different 
units'  calculations,  no  additional  message  traffic  ia 
required  on  the  link. 

? r.tersl i p  Senjor  Alignment 

A  method  for  determining  sensor  azimuth  biaa  ia 
aa  follows:  If  UQ  hold*  another  link  participant,  Ujt 
on  the  aenaor  to  be  aligned,  then  the  one-paaa 
reaidual  bias  estimate  using  Uj  is  simply 

49  -  tan*1  ( (xqVj  -  xjy„)/(yjy0  ♦  XjXQ))  (1) 

where  (xq,  yQ)  ia  the  position  of  Uj  acotding  to  Uq 
sensors,  and  (xj,  yj )  is  the  position  of  Uj  according 
to  his  own  report,  expressed  in  an  East-North  (or  U-V) 
aligned  coordinate  frame  centered  at  Uq  (see  Figure  3). 


If  Uq  does  not  hold  another  participant  on  the 
se> sor  to  be  aligned,  Uq  can  resort  to  a  second  method 
of  estimating  sensor  bias,  utilizing  a  mutual  track 
held  by  Uq  end  another  link  participant  Uj,  as 
follows : 

Let  T  be  the  object  heing  tracked  by  both  Uq  and 
Uj,  and  reported  by  Uj  over  the  link.  Let  Rq,  Rj 
be  the  distances  from  Uq  to  T  and  from  Uj  to  T, 
respectively.  Let  D  be  the  distance  between  Uq 
and  Uj  (see  Figure  A). 

Construct  a  rectilinear  x',  y'  grid  at  Uq  with 
positive  x'-axis  through  Uj.  Then  the  position 
of  T  is  calculated  aa: 

x'  -  (Rq  -  Rj  ♦  D2)  /  2D  (2a) 

y'  *  iV*o  *  <*'>2  (2b) 

The  choice  of  sign  for  y*  is  determined  by  the 
sign  of  (,  the  angle  formed  by  the  positive 
x'axis  and  the  ray  from  Uq  to  T. 


Figure  4.  Utilizing  Mutual  Track 


Figure  3.  Using  Rel  Nav  Position  of 
Participating  Unit 


First,  consider  the  general  error  model  for  any 
one-pass  azimuth  bias  estimate.  The  error  eg  is,  in 
general,  a  function  of  five  random  variables: 


Xq  ■  noise  in  measuring  Uq  azimuth  (radians) 
X]  ”  noise  in  relative  position  estimates 
by  Uq,  Uj  (feet) 

X2  ■  noise  and  bias  in  Uj  azimuth  (radians) 

Xq  -  noise  and  bias  in  Uq  range  (feet) 

X4  *  noise  and  biaa  in  Uj  range  (feet). 


These  five  random  variables  are  assumed  to  be 
independent.  Xq  and  Xj  have  mean  0. 


For  each  residual  bias  estimate  using  equation 
(1),  eg  can  be  approximated  linearly  by 
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«9  *  X0  ♦  Xj  /  D 


(3) 


If  equation*  (2a  and  2b)  are  employed,  let  T  be 
tne  angle  between  bearing*  to  T  from  Uq  and  Uj.  Then 

ce  •  Xq  ♦  Xj  /  D  ♦  X3  cot  ?  /  R0 

♦  X4  cac  T  /  Rq  +  S(X0)  (4) 

where  S(Xq)  i*  the  error  in  choice  of  aign  for  y‘. 

From  equation*  (3)  and  (4),  it  follows  that  if 
equations  (1)  or  (2*  and  2b)  .re  used  to  calculate 
residual  bias,  then: 

1.  eg  is  independent  of  X;; 

2.  If  equation  (1)  is  used,  the  estimate  is  the 
optional  measurement,  since  any  measurement 
includes  errors  Xq  and  X,/0; 

3.  The  coefficients  for  X3  and  X4  are  bounded  by 
1/Rq  Sin  T. 

For  many  sensors  (e.g.,  radars),  range  errors 
are  small  compared  to  the  ranges  at  which 
they  operate.  In  these  cases,  this  bound 
implies  that,  with  auitable  geometry  con¬ 
straints,  the  X3  and  X^  terms  of  (4)  are  no 
greater  than  Xq  and  frequently  are  much  less. 

4.  The  error  S(Xq),  resulting  from  the  choice  of 
the  incorrect  aign  in  Equation  2b,  can  be 
handled  by  observing  that  S(Xq)  «  0  or  S(Xp) 

“  -2y'/Rg,  depending  on  the  choice  of  sign 
for  y'.  Therefore,  for  any  fixed  value 

yg  >  0,  one  can  require  that  |y'|  >  yg  in 
Equation  2b;  otherwise,  the  track  T  will  be 
discarded  from  further  processing.  If  S(Xq) 

*  0,  then  |S(X0)|  >  2|y'|/K0  >  2>q/R0. 
therefore,  if  one  rejects  any  residual  bias 
estimate,  46.  greater  than,  say,  yg/Rg,  then 
for  any  accepted  48,  the  probability  of  S(Xq) 
i  0  is  less  than  the  probability  that  the 
total  sensor  asimuth  error  (Xq  <  biar) 
exceed*  yg/Rg.  Thur,  the  proba'  ility  of  S(X) 
^  0  can  be  made  arbitrarily  small  by  choosing 
yg  to  be  sufficiently  large;  the  choice 
depends  upon  the  distribution  of  the  total 
errors  experienced  by  a.-sors  of  the  tvpc 
being  aligned. 

The  proposed  bias  estimates  are  thus  expected  to 
give  errors  on  the  order  of  Xq  ♦  Xj/D  under  minimal 
geometric  constraints.  If  the  variance  of  this 
expression  is  given  by  o2,  then  the  least-squares 
approximation  using  n  observation*  will  have  an  error 
variance  of  approximately  o2/n.  When  implemented  in 
varied  simulated  environments,  the  methods  described 
above  exhibit  errors  within  these  expected  values, 
with  considerably  lower  processing  than  required  by  a 
least-square*  filter  which  models  the  (x,  y)  position 
Of  Uq,  U.,  and  T  for  each  track. 
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The  use  of  Reciprocal  Polar  co-ordinates  to  para¬ 
meter  ise  tracks  is  discussed.  Their  use  makes  the 
problem  of  determining  the  characteristics  of  tracks 
from  bearings  only  data  much  less  non-linear  than 
the  use  of  the  more  conventional  Cartesian 
co-crdinates.  ^ 

Introduction 


The  task  of  tracking  a  moving  target  from  sensor 
data  can  be  formulated  by  assuming  that  the  target 
moves  with  constant  velocity.  Once  this  basic 
problem  has  been  solved,  we  may  extend  it  by  assum¬ 
ing  that  the  target  manoeuvres  from  time  to  time, 
th  it  is  to  say  it  changes  its  speed  or  course.  For 
t.  2  basic  problem  it  is  natural  to  represent  the 
pcsition  of  the  target  in  terms  cf  parameters 
^3  and  such  that  the  Cartesian  co¬ 
ordinates  of  the  target  at  time  t  ares 

<Xe  ♦  +  5,(t  -  to>  ,  Y„  ♦  ♦  £,(t  -  t*,)) 

where  (Xe,  Y0)  denote  some  convenient  origin,  and  tc 
is  some  convenient  time.  We  can  then  compute  fitted 
values  for  the  observations  of  bearing,  range  or 
rote  of  change  of  range,  and  hence  find  parameter 
values  to  minimise  some  weighted  «•  m  of  the  squares 
of  the  discrepancies  between  the  measured  and  fitted 
observations. 

However,  if  most  of  the  data  consist  of  boarxnas 
from  a  single  slowly  moving  platform,  then  the 
model  is  highly  non-lincar  in  the  sense  that,  over 
the  whole  range  of  plausible  target  tracks,  the 
fitted  bearings  are  highly  non-linear  functions  of 
the  parameters.  This  nonlinearity  can  largely  be 
removed  by  using  a  .dif  ferent  paramotcrisation.  We 
shall  refer  to  the  parameterisat ion  we  chose  to 
reduce  the  nonlinearity,  as  Reciprocal  Polar  co¬ 
ord  i nafec , 

In  the  course  of  this  paper,  we  outline  our  reason" 
for  choosing  this  paramoterisation  in  the  first 
place,  and  then  proceed  to  describe  the  work  we 
carried  out  to  confirm  our  ideas.  In  the  main  this 
consisted  of  calculating  tie  non-lineaiities  for  a 
particular  scenario  with  respect  to  various  para- 
meterisations . 


1936  Formula 

Beule  11 J  suggested  and  Bate*?  and  Watts  (2J  confirmed 
that  Many  nonlinear  regression  problems,  of  which 
the  Bearings  Only  problem  discussed  here  is  an 
example,  can  be  made  much  less  nonlinear  by  making 
a  suitable  nonlinear  transformation  of  parameters. 
Tenney  et  a l  J3J  discuss  a  related  problem,  and 
propose  a  transformation  that  greatly  improves  on 
Cartesian  co-ordinates. 
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An  appropriate  transformation  for  the  Bearings  Only 
problem  is  suggested  by  considering  the  standard 
1936  Formula  for  estimating  a  target's  range.  If  the 
observed  bearing  rate  is  B(  when  own  ship  moves  with 
a  velocity  U»  across  the  bearing  and  is  B-*  when  own 
ship  moves  with  a  velocity  Ua  across  the  bearing, 
then  if  the  target  is  at  range  R  and  moves  with  a 
velocity  component  V  at  right  angles  to  the  bearing, 
we  have 

Bi  =  v~ui  and  Bl  *  V-IU 
R  R 

or,  in  other  words 

i  =  B»  ~Ba  and  V  *  U^B,  -  U,B& 

R  Ua-U,  R  Ua  -  U# 

This  means  that  the  reciprocal  of  the  range  and  the 
bearing  rate  from  a  stationary  point  near  own  ship 
are  linear  functions  of  the  data  B(  and  B**. 

Proposed  Parameterlsation 


The  above  analysis  suggests  that  the  parameters 
,  %2 >  ^3  and  should  be  replaced  by  parameters 
and  representing  the  bearing  and  the 
reciprocal  of  the  range  of  the  target  ti^e-  t0 
from  a  point  at  c*  r  owr>  ship's  position  at  this 
time,  the  rate,  and  parameter  related 

Aan ne  rate.  Our  first  thought  was  that 
should  b'_-  the  idle  of  chany«-  of  the  reciprocal  of 
the  range.  But  further  work,  described  lat*>r, 
suggests  that  ^  should  be  relative  range  rate,  that 
is  the  velocity  of  the  target  along  the  bearing  line 
divided  by  the  range.  It  is  natural  to  call  these 
Reciprocal  Polar  co-ordinates.  To  use  them  we  need 
formulae  for  the  as  functions  of  the  pj  and  for 
elements  of  the  Jacobian  matrix  (  ) .  Gi ven 

formulae  for  linear  approximations  to  the  fitted 
values  of  any  observation  in  terms  of  the  5, ,  we 
can  then  derive  corresponding  linear  approximations 
in  terms  of  the  pfj. 

Batch  Filter 


We  ha,re  implement  ;d  a  Mbatch  filter'5,  based  on 
Reciprocal  Polar  co-ordinates.  It  is  an  iterative 
non-linear  least  squares  procedure  for  estimating 
target  tracks  from  passive  bearing  data,  and 
calculates  the  suns  af  squares  of  residuals  in  terms 
of  Reciprocal  Polar  co-ordinates.  The  program 
currently  only  considers  nen-manoeuvring  targets. 

The  batch  filter  is  ut ed  extensively  by  the  Track 
Association  Algorithm  [4j. 
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Sp«  :ial  precautions  taken  in  the  implementation  of 
the  batch  filter  are  as  follows. 

(U  There  is  a  need  to  impose  both  a  lower  and  an 

upper  bound  on  px  (the  reciprocal  of  the  target's 
range) .  The  lower  bound  is  to  ensure  that  the 
ta  get's  range  remains  realistic  while  the 
upper  bound  avoids  the  singularity  at  the 
origin.  These  bounds  can  be  imtvased  by 
pivoting  last  on  this  parameter  and  modifying 
the  data  before  the  pivot  so  that  it  will  take 
the  desired  boundary  value.  The  method  of 
pivoting  is  described  by  Stiefel  15]. 

(2)  It  is  necessary  to  have  a  safeguard  against  non 
monotonic  behaviour  in  the  residual  sum  of 
squares,  and  can  be  achieved  using  a  simple 
bisection  formula. 

Benefits 


Reciprocal  Polar  co-ordinates  are  superficially 
more  complicated  than  their  Cartesian  counterparts, 
but  we  suggest  they  offer  the  following  advantages. 

(1)  The  task  of  computing  true  least-squares 
estimates  of  the  target  parameters  is  simpli¬ 
fied,  since  we  need  fewer  iterations  and  can 
dispense  with  the  usual  Levenberg-Marquardt 
modification  to  the  Gauss-Newton  method.  In 
general  only  3  or  4  iterations  are  required 
to  get  close  to  the  least  squares  solution. 

(This  solution  's  not,  of  cou—e,  dependent 
on  the  parameterisation  used) . 

(2)  The  parameterisation  allows  the  computation 
of  realistic  confidence  Intervals  for  the 
target  parameters,  and  in  particular  che 
current  target  range.  This  confidence  interval 
may  be  unsymmetrical ,  since  we  will  often  know 
that  the  target  cannot  be  very  close  but  may 
be  much  further  away  than  our  best  estimate. 

This  information  is  particularly  relevant  when 
one  has  to  decide  whether  ?  tracks  can  reason¬ 
ably  be  attributed  to  the  same  target. 

(3)  finally. a  nearly  linear  paramo?  -t-isation  ullcws 
some  use  of  sequential  (that  is  non  iterative) 
estimation  procedures,  and  in  particular  the  use 
of  manoeuvre  detectors  based  on  cumulative  sums 
of  the  Innovations  recommended  by  Brown  et  al 
(6]  and  Patel  (7], 


Wa  calculated  and  Ng  for  a  givi l  scenario, 
firstly  for  Cartesian  co-ordinates,  and  secondly 
for  the  family  of  co-ordinate  systems  satisfying 


-  B 

-  R**  fi 

where  B  and  R  are  the  target's  bearing  and  range, 
and  ar,  and  otj  are  constants.  Note  that  we  have 
Reciprocal  Polar  co-ordinates  when  «*,  and  «<a  *  -1. 


The  scer.rio  i3  illustrated  in  Figure  1. 


CAtcviArtos  cr  iisaiutMmis  -  scenario 


Figure  ( 


We  have  verified  these  theoretical  advantages  to  the 
extent  of  having  developed  the  batch  filter,  which 
we  outlined  earlier,  and  a  program  to  compute  the 
non-linearity  for  various  parameterisations  of  a 
typical  bearings  only  scenario.  In  tne  next  section 
we  discuss  this  computation  in  detail. 


The  ship  travels  on  a  dog-leg  course  at  10  knots 
making  a  bearing  observation  every  10  minutes. 
Observation  errors  are  noimally  distributed  with  a 
standard  de'  iation  of  2°.  Altogether  the  ship  makes 
V  observations.  The  target  submarine  is  initially 
50nm  from  the  ship  a»:d  is  travelling  at  a  constant 
20  knots. 


Calculation  of  Non-linearity 

The  possibility  that  a  non-linear  transformation 
of  parameters  can  make  a  non-linear  regression  model 
much  less  non-linear  was  mentioned  earlier.  This 
suggests  that  non-linearity  can  be  split  into  two 
typess  Beale  (1]  called  these  Intrinsic  Non¬ 
linearity  and  Parameter  Effects  Non-linearity. 

Beale  defined  formulae  for  non-linearity  such  that. 

*1*  (the  Intrinsic  non-linearity)  is  the  minimum 
measurement  of  ron-linearity  taken  over  all  para¬ 
meterisations  of  the  model,  while  Hg  (the  parameter 
effects  non-linearity)  is  the  total  non-linearity 
for  z  given  parameterisation  minus  Nf, 


The  intrinsic  non-linearity  for  this  scenario  is 
0.00191,  while  the  removable  non-linearity  for 
Cartesian  co-ordinates  is  0.547.  Figures  2,3  and  4 
show  the  value  of  the  removable  linearity  for  the 
family  of  co-ordinate  systems  for  various  values  of 
ar,  and  These  figures  indicate  that  the  para- 
meterisation  with  n,  =  "1  (i.e.  Reciprocal  Polars) 
gives  a  minimum  of  the  removable  non-linearity  for 
the  scenario.  Furthermore  the  total  non-linearity 
using  Reciprocal  Polar  co-ordinates  is  more  than  a 
factor  of  12  less  than  the  corresponding  figure  for 
Cartesian  co-ordinates.  These  results  justify  our 
original  reasons  for  defining  Reciprocal  Polar  co¬ 
ordinates. 
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Further  Work 

The  work  described  in  this  paper  has  investigated 
the  non-linearity  of  various  paraaeterisations  when 
the  observed  data  consist  of  bearings  only.  However 
the  bearing  observations  are  often  made  in  con¬ 
junction  with  frequencies  which  can  provide  infor¬ 
mation  on  a  target's  closing  speed. 

We  intend  in  the  near  future  to  calculate  the  non- 
linearities  of  various  paraaeterisations  when 
frequency  information  is  available.  Such  para- 
meterisations  do  of  course  require  a  fifth  frequency- 
related  ^orameter.  It  is  perhaps  worth  noting  that 
our  batch  filter  based  on  Reciprocal  Polar  co¬ 
ordinates  has  no  problems  converging  when  processing 
data  including  frequencies. 

We  shall  also  be  extending  the  batch  filter  so  that 
it  can  track  manoeuvring  targets  by  making  use  of 
innovations. 


Figure  3 
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Summary 


We  have  proposed  the  use  of  Reciproo  il  Polar 
co-ordinates  as  a  means  of  reducing  the  non¬ 
linearity  of  the  Bearings  Only  tracking  problem. 

The  co-ordinate  system  is  suggested  from  considering 
the  1936  formula,  and  was  justified  more  formally 
by  calculating  the  intrinsic  and  parameter  effects 
measurer  of  non-linearity  for  a  given  scenario 
using  various  parameterisations.  Practically,  we 
have  implemented  a  batch  filter,  based  on 
Reciprocal  Polars,  which  minimises  the  sums  of 
squares  o *  residuals.  The  filter  works  well  and 
converges  quickly. 
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Abstract 


The  problem  of  selecting  a  maneuver  policy  fcr 
single-platform  bearincs  only  search  and  localizat.’ on 
in  the  ocean  is  viewed  as  a  stochastic  optimal  control 
problem.  A  solution  is  developed  based  on  a  realistic 
environment  model  and  extended  Kalman  filter  tracker. 
Simulation  results  highlight  the  dominance  of  engage¬ 
ment  geometry  {e.g.  direct-path  vs.  convergence  tone) 
over  maneuver  policy  with  respect  to  total  performance 


I.  Introduction 


A 

I 


1. 1  The  Bearings-Only  Problem 


Due  to  the  complex  nature  of  sound  propagation  in 
the  sea,  a  passive  acoustic  search  involving  ocean 
vehicles  inherently  provides  a  minimum  of  information 
of  a  very  noisy  nature  to  the  searcher.  Currently  the 
research  emphasis  seems  to  be  toward  developing  methods 
to  utilize  the  acoustic  information  that  is  available 
in  a  more  efficient  manner.  For  instance,  acoustic 
doppler  information  can  often  be  gleened  from  the 
signal  spectrum  of  a  target,  providing  range  rate 
information  on  the  contact.  While  this  knowledge  of 
range  rate  can  be  a  tremendous  aid  in  specifying  the 
target's  state  (range,  bearina,  course  and  speed) , it  is 
opera ticnaly  easy  for  an  adversary  to  introduce  suffi¬ 
cient  randomness  in  the  frequency  spectrum  emitted  by 
his  vessel  to  make  his  acoustic  signal  unreliable  for 
this  purpose.  Another  are a  of  interest  has  been  to 
increase  the  acous-ic  aperture  of  the  hydrophone 
arrays  on  board  present  searching  units.  By  measuring 
the  time  differences  between  the  acoustic  signals 
incident  to  each  hydrophone  combined  with  the  array's 
bearing  resolution,  passive  target  localization  is 
possible.  However,  with  current  capcilities,  such  an 
increased  acoustic  aperture  is  too  large  for  feasible 
installation  on  searching  vessels.  Thus,  the  acoustic 
information  available  on  each  source  is  ofLofi  limited 
to  frequency,  sound  pressure  level  (SPL)  and  bearing. 
Frequency  information  can  be  extremely  valuable  for 
use  in  target  classification,  but  for  tracking  and 
especially  detection  func.ions,  it  has  many  limitations 
Due  to  the  propagation  vagaries  of  sound  in  the  sea, 

SPL  information  may  also  be  quite  misleading.  Conse¬ 
quently,  bearing  information,  although  potentially 
noisy,  provides  the  best  single  means  of  target  de¬ 
tection  and  tracking  with  current  passive  acoustic 
sensors  in  the  ocean  environment. 

1 . 2  Approach 

The  approach  to  the  passive  acoustic  search 
problem  followed  in  this  paper  consists  of  three  major 
parts.  First,  a  realistic  model  of  the  ocean  environ¬ 
ment  including  the  searcher  and  target  vehicles  Will 
be  develooed  in  order  to  simulate  the  stochastic  and 
dynamic  behavior  in  the  "real  world".  The  model  will 
include  the  behavior  inherent  in  its  four  ..iajor  compo¬ 
nents:  the  target,  the  medium,  the  searching  vehicle 
and  the  searcher's  sonar  equipment.  Second,  an 
estimator  will  be  developed  which  will  process  inform¬ 
ation  provided  by  the  model  as  an  aid  in  making 
various  search  decisions.  This  estimator  will  be 


exercised  principally  in  a  localizing  search  of  a 
given  area  to  provide  the  best  estimate  possible  of  a 
target's  course,  speed  and  range  at  any  instant  in  time. 
Finally,  based  on  all  of  the  information  provided  from 
both  the  model  and  the  estimator,  a  strategy  will 
contain  all  of  the  necessary  control  logic  to  conduct 
the  search. 

With  these  three  major  components  connected  properly 
a. series  of  strategies  which  seem  reasonable  can  and 
will  be  evaluated  in  an  open- loop  manner  by  means  of 
computer  simulation.  By  testing  these  strategies 
representing  various  scenarios  of  interest,  the  advan¬ 
tages  and  disadvantages  of  each  can  be  determined  for 
different  types  of  vehicle  encounters.  Subsequently, 
the  desirable  portions  of  each  strategy  can  be  imple¬ 
mented  with  the  required  decision  logic  to  conduct 
closed-loop  simulations  for  final  evaluation. 

H.  The  Model 

The  ocean  environment  is  an  extremely  inhomogeneous 
and  unpredictable  medium  in  which  to  conduct  a  search. 
The  acoustic  interactions  between  the  propulsion 
machinery  and  sonar  equipment  on  bv  ard  target  and 
seach  vehicles  in  these  surroir  dings  make  the  passive 
search  and  tracking  problems  extremely  difficult  to 
describe  accurately  yet  succinctly.  Although  more 
difficult  than  many  systems,  the  ocean  vehicle  search 
problem  can  be  approached  in  a  conceptual  way  as  a 
complicated  control  problem.  As  in  the  control  of  any 
system,  it  is  necessary  to  establish  a  model  which  is  a 
trade**  off  between  two  conflicting  desires.  The  first 
and  most  obvious  desire  is  to  establish  a  model  which 
is  detailed  enough  to  adequately  represent  the  actual 
system  to  be  controlled.  Secondly,  an  equally 
necessary  character! s:ic  of  a  useful  model  is  i.hat  it 
is  simple  enough  to  "solve"  for  a  practical  control 
strategy.  As  the  complexity  of  a  system  increases, 
finding  an  acceptable  model  which  satisfies  both  of 
these  desires  becomes  more  and  more  difficult. 

2.1  Environn  nt 

An  examination  of  the  passive  sonar  equation  can 
provide  a  model  which  reasonably  approximates  most 
interesting  areas  of  the  ocean.  In  general,  the  most 
important  components  of  the  transmission  loss  will  bo 
spreading, ( spherical  and  cylindrical) , absorption, 
leakage  and  convergence  zone  effects.  A  reasonable 
approximation  for  most  areas  is  the  transmission  loss 
expression  for  the  mixed-layer  case  corrected  for 
convergence  gain  effects  at  the  appropriate  convergence 
ranges. 

The  average  values  of  the  various  sonar  p.  am* tors 
in  the  sonar  equation  can  be  used  to  compute  a  j  reliabi¬ 
lity  of  detection  function  to  be  used  in  the  actual 
model  implementation.  This  conversion  from  average 
values  to  some  type  of  probability  distribution  or 
statistic  is  necessary  to  model  the  intermittent 
directions  made  at  the  sonar  system  output  for  time- 
varying  SNR's  near  zero.  From  Knight's  U)  de¬ 
scription  of  current  digital  signal  processing  techno¬ 
logy  used  in  sonar  systems,  a  graph  of  probability  of 
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detection  versus  SNR  can  be  derived  (see  Section  IV) . 

2 . 2  Vehicle  Dynamics 

In  this  model ,  target  courses  and  speeds  can  be 
modeled  in  a  variety  of  ways.  While  in  this  application 
they  were  deterministic  functions  of  time#  t^oy  could 
be  determined  by  any  other  desired  function  of  de- 
terrtir  istic  or  random  variable.  In  the  present  case, 
the  dynamics  of  each  vehicle  were  computed  by 

X.i+l)  *  X (i)  +  L  *VS*StN(VC> 

Y (i+l)  »  Y (i)  ♦  DT*VS  *COS(Vy) 

where  X  and  Y  *  the  cartesian  coordinates  of  the 
vehicle 

DT  *  the  time  difference  b«_ tween  the 
i  and  i+1  observations 

VS  11  vehicle  speed 

VC  e  vehicle  course. 

2. J  Observations 

The  actual  bearings  between  searcher  and  target 
were  then  computed  by 

A3  -  ARCTAN ( (XT-XS) /YT-YS) ) , 

where  XT  and  YT  *  the  cartesian  coordinates  of  the 
target 

XS  and  YS  *  the  cartesian  coordinates  of  tue 
searcher. 

These  actual  bearings  do  r.ot  represent  bearings  that 
realistically  would  be  observed  in  a  noisy  environment. 
In  order  to  model  these  observed  bearings  a  zero-mean# 
Gaussian  noise  component  with  a  standard  deviation  of 
.1  degrees  was  added  to  each  actual  bearing.  The  statis¬ 
tics  of  tnis  observation  noise  can  be  relatively  accura¬ 
tely  predicted  from  the  detection  equipment  used;  it 
implies  an  observation  noise  variance  of  R  -  ,C\. 

2 • 4  Probability  of  Detection  Versus  Range 

Besides  the  large  number  of  measurements  that  must 
be  made,  a  limitation  of  another  kind  is  produced  by 
the  nature  of  the  medium  in  which  sonars  operate.  The 
sea  is  a  moving  medium  containing  inhomogeneities  of 
various  kinds,  toqether  with  irregular  boundaries, 
one  of  vhich  is  in  motion.  Multipath  propagation  is 
the  rule.  As  a  result,  many  of  the  sonar  parameters 
fluctuate  irregularly  with  time,  while  other  change 
because  of  unknown  changes  in  the  equipment  end  the 
platform  on  which  it  is  mounted,  because  of  these 
fluctuations,  a  "solution"  of  the  sonar  equation  is  no 
more  than  a  best-guess  time  average  of  which  is  to 
be  expected  in  a  basically  stochastic  problem,  "recisc 
calculations,  to  tenths  of  decibels,  are  futile.  A 
predicted  sonar  detection  range  is  an  average  quantity 
about  which  the  observed  values  of  range  are  likely 
to  congregate.  Thus,  the  model  supplies  the  probabi¬ 
lity  of  detection  versus  range  for  the  target  of 
interest  to  be  used  on-line  for  target  state  estimation 
and  strategy  formulation  purposes.  The  calculation 
of  a  specific  probability  of  detection  versus  range 
curve  is  included  in  Section  IV  for  use  in  various 
open- loon  simulations. 

III.  The  Estimator 

There  are  many  istimator  designs  in  the  literature 
which  have  been  applied  to  various  search  and  tracting 
problems.  For  the  passive  ocean  search  problem  there 
are  several  factors  which  will  influence  the  selection 
of  the  estimator.  For  a  bearinqs-only  search  and  track 
scenario  the  dynamics  of  vehicle  motion  are  represented 
by  nonlinear  equations.  In  this  case  involving 
imperfect  information,  the  implementation  of  such  an 
estimator  is  by  no  means  easy . 


.1 . 1  Bear  lags -only  Tracking 

The  bearings-onl y  case  of  target  tracking  intro¬ 
duces  some  unusual  problems.  The  many  sources  of  noise 
which  may  affect  recoi- ed  bearing  information  in  the 
ocean  environment  have  already  been  discussed.  Most 
of  the  other  problems  associated  with  bear ii.g-only 
tracking  involve  the  specific  geometries  which  are 
involved.  Convergence  of  filter*?  used  for  bearings- 
onlv  tracking  depends  on  man^  factors.  Target  traject¬ 
ories  involving  long  ranges  and  small  velocities  nay 
make  accurate  solutions  take  longer  to  converge  since 
the  change  in  bearing  with  time  is  quite  small  [5J. 

Another  condition  of  geometry  which  has  been  shown 
to  have  a  very  profound  effect  on  tracking  performance 
in  the  bearings-only  case  is  the  choice  of  coordinate 
system  for  estimator  implementation  [6J .  Since  the 
final  filter  configuration  for  any  specific  problem 
will  ultimately  depend  upon  which  reference  is 

employed  during  problem  formulation,  it  is  not  sur¬ 
prising  to  find  that  Cartesian  coordinates  are  used 
extensively  to  formulate  target  notion  analysis  (TMA) 
estimation  problems  in  the  context  of  the  EKF.  This 
reference  frame  permits  a  simple  linear  representation 
of  the  state  dynamics:  all  system  nonlinearities  are 
embedded  in  a  single  scalar  measurement  equation  t7] . 
Such  a  modeling  structure  is  especially  appealing  for 
practical  application  because  it  minimizes  filter 
computational  requirements. 

Many  non-Cartesian  filters  have  been  used  which 
possess  significantly  different,  and  perhaps  better, 
performance  characteristics  than  their  Cartesian 
counterparts  [81 .  In  fact,  there  is  new  theoretical 
and  experimental  evidence  that  seems  to  indicate  that 
Cartesian  filter  implementations  may  be  unstable  for 
single  sensor  bearings-only  TMA  14)  .  Specifically, 
these  results  indicate  that  unique  interaction  and 
fee<.  oack  of  estimation  errors  within  this  filter  render 
it  highly  iUsceptible  to  premature  solution  divergence 
and  covariar ~e  collapse. 

3.3  Estimator  Architecture 

The  estimator  chosen  to  conduct  bearings-only  TMA 
in  this  application  consists  of  a  three-dimensional 
array  of ^Extended  Kalman  Filters  with  hypothesis 
testing  occuring  along  each  of  the  three  dimensions. 

By  assuming  that  the  bearing  noise  is  zero  mean  and 
Gaussian,  the  FKF  may  be  used  to  provide  estimates  of 
range,  course,  and  speed  euolb  given  bearing  observ¬ 
ations  in  a  large  fraction  of  pos^.\o*o  cases.  The 
problem  is  that  in  the  bearings-only  case,  iniuia! 
course,  speed  and  range  inform**- i nn  is  not  readily 
available  and  in  order  co  an  F.KF  to  be  effective,  it 
must  be  initialized  relatively  o’ose  Lne  actual 
solution.  la  wider  to  overcome  this  problem  we  employ 
the  toctM»-*‘ue  of  hypothesis  Lusting ,  hypothesizing 
courses,  speeds,  and  initial  ranges.  Two  of  the 
dimensions  of  the  array  of  EKF's  will  be  baseu  on 
hypothesized  discrete  courses  and  speeds  which  span 
the  set  of  possible  targets.  The  lifters  will  then 
generate  estimates  of  the  differences  in  course  and 
speed  between  tne  actual  and  hypothesized  values. 

The  third  dimension  of  the  estimator  will  consist  of 
three  initial  range  estimates,  two  based  on  direct  path 
and  one  or  two  based  on  convergence  zone  range  of 
detection  predictions.  Thus  by  observing  only  bearings 
with  noise,  this  array  of  EKF's  will  estimate  ranges 
and  bearings  to  the  target  and  differences  between 
estimated  and  hypothesized  courses  and  speeds  of  the 
target.  By  hypothesis  testing  based  on  the  covariance 
matrices  computed  during  the  tracking  evolution,  some 
filters  will  be  turned  off  and  the  "most  likely" 
filters  to  be  "correct"  will  be  left  running. 
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3. 3  The  Extended  Kalman  Filter 


The  Extended  Kalman  Filter  Equations  are  aa  follows 
[2].  The  state  vector  used  in  both  mean  prediction 
and  Man  update  steps  is 

X  -  iF: 

~  iBt 
lOCi 
iDSi 

where  R  •  target  range  from  tearcher 

B  "  target  true  bearing  from  searcher 
DC  •  delta  course  from  hypothesized  target  course 
CS  -  delta  speed  from  hypothesized  target  speed 

The  reason  for  picking  a  non-Cartesian  state  vector 
have  previously  been  explained.  Finding  the  optimal 
non-Cartesian  coordinate  system  was  not  the  goal  of 
this  effort.  This  coordinate  system  is  used  since  it 
uses  the  same  four  parameters) (range,  bearing,  course, 
and  speed!, that  are  used  operationally  and  thus  are  the 
most  logical  parameters  to  extend  directly  into  strate¬ 
gy  f-rsiulation  without  needing  prior  transformation. 

Since  the  state  dynamic  equations  are  nonlinear,  a 
linear  approximation  to  the  A  matrix  used  for  co- 
variance  prediction  must  be  used.  A  first  order  linear 
approximation  was  used  such  that  the  tow s  of  A  are  the 
partial  derivatives  of  the  predicted  state  with 
respect  to  the  previous  state  vector. 

The  prediction  includes  the  normal  addition  of  un¬ 
certainty  £  to  the  covariance  after  transformation  to 
account  fox  the  dynamics.  An  additional  term,  2.,  is 
necessary  for  tne  nonlinear  covariance  prediction  to 
account  for  higher  order  terms  in  the  Vaylor  Series 
expansion  that  were  neglected  in  obtaining  the  A  matrix. 
As  used  ini  this  application,  £  and  £  were  assumed 
constant  and  an  approximation  to  their  sum  was  obtained 
empirically.  In  order  to  find  this  sum  a  series  of 
sample  runs  was  conducted  with  the  searcher  in  various 
opening  and  closing  scenarios  representing  the  full 
range  of  searcher-target  aspects.  A  circumnavigation 
of  the  moving  target  vehicle  by  the  searcher  was 
primarily  used  since  it  exhibited  a  wide  variety  of 
angles  of  Encounter.  First,  an  estimat  of  each 
diagonal  tirm  in  the  £  matrices  was  made  that  was 
about  two  orders  of  magnitude  below  thr.  initial  errors 
to  be  expected  in  practice  and  used  in  the  simulations. 
With  these  lvalues  serving  as  an  initial  baseline  from 
which  to  start,  runs  were  made  while  varying  these 
values  one ; at  a  time  three  orders  of  magnitude  above 
and  below  these  original  values. 

From  these  initial  tests,  a  first  approximation  of 
the  "best"  value  for  each  of  the  four  diagonal  terms 
was  determined  by  comparing  the  diagonal  element:  of 
the  updated  covariance  matrices  generated  with  the 
actus’  errors  on  each  state  squared.  The  criterion  was 
to  pick  Q  values  which  caused  the  computer  filter 
covariance  matrix  to  best  match  the  magnitudes  of  the 
squares  of  the  Ftate  errors.  These  new  values  of  Q 
served  as  a  more  precise  baseline  from  which  each  of 
the  values  of  £  were  varied  again.  By  the  fourth  base¬ 
line  of  £  values,  the  £  matrix  sum  was  within  an  order 
of  magnitude  of  the  "best"  value  on  each  diagonal  term 
and  the  covariance  terms  were  within  a  factor  of  three 
of  the  squared  state  error  terms.  The  final  value 
obtained  from  thie  iterative  optimization  procedure 
was 

C  ♦  ft  "  f  .0001  1 

_  .00031 

I  .0301 

L  .00001 

where  a  typical  resulting  filter  covariance  matrix  and 
squared  state  error  matrix  were 
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Again,  the  non-diagonal  elements  of  the  £  matrix 
were  assumed  to  be  small  and  were  neglected. 


3.4  Hypothes’s  Testing 

All  of  the  computations  thus  far  have  assumed  a 
prior  knowledge  of  the  target's  course  and  speed.  This 
information  is  provided  by  hypothesizing  various  combi¬ 
nations  of  course  and  speed,  one  for  each  Extended 
Kalman  Filter.  Initially,  twelve  courses  and  twelve 
3peeds  were  hypothesized  to  test  the  filter.  The  hypo¬ 
thesized  courses  started  of  000  degrees  true  with 
increments  every  30  degrees  and  the  speeds  were  at 
increments  of  4  knots  starting  at  0  knots.  As  the 
final  Implementation  including  the  sume  of  the  Q  matrix 
terms  became  clear,  a  review  of  the  filter's  ability 
to  distinguish  errors  in  course  and  speed  was  conducted 
to  determine  if  this  initial  discretication  of  nypo- 
thesized  courses  and  speeds  was  valid. 

A  criterion  of  having  the  hypothesized  values 
spaced  no  more  than  four  times  as  far  apart  as  the 
average  filter  estimate  errors  seemed  to  ensure 
adequate  estimator  filter  overlap.  Fiom  the  test  runs 
conducted  with  the  appropriate  £  matrix  sum  terms, 
average  absolute  errors  in  couise  and  speed  of  7.5 
degrees  and  1.5  knots  were  indicated.  Therefore  the 
final  hypothesized  courses  were  taken  every  30  degrees 
in  a  circle  starting  at  000  degrees  true  for  a  total 
of  12  different  courses.  The  hypothesized  speeds  were 
taken  at  increments  of  6  knots  from  0  to  42  knots  for 
a  total  of  8  different  speeds. 

The  final  estimator  consisted  of  three  banks  of 
96  Extended  Kalman  Filters  with  all  of  the  expected 
.ourses  and  speeds  dircretized  in  each  bank.  Two  bank- 
of  filters  were  Initialized  at  two  direct  path  ranges 
of  detection.  Two  initial  direct  path  ranges  were 
chosen  since  by  adding  the  range  estimate  error  deter¬ 
mined  to  be  acceptable  for  the  EKF  performance  during 
trial  runs  to  these  two  initial  ranges,  the  ‘■otal 
encompassed  range  band  approximated  the  expected  errors 
on  the  range  prediction  provided  by  the  sonar  equations. 
The  third  filter  was  ini.ializcd  at  the  predicted  r*-ge 
nl  detection  in  Uie  first  convergence  -one.  Since  Luc 
first  convergence  zone  '«  the  narrowest  convergence 
zone,  only  one  inicial  range  was  required  to  match  the 
EKF's  range  error  performance  to  Its  expected  width. 

The  only  reraining  requirement  to  implement  this 
hypothesis  testing  scheme  is  to  pick  some  statistic 
generated  by  each  of  the  288  filters  on  which  a 
decision,  "correct  filter"  or  "incorrect  filter"  could 
be  based.  The  statistic  used  was 

X2  -  R£S2/(R  +  P (2, 2) ) 

where  RES  -  the  residual  as  computed  during  the  mean 
update 

R  >  the  bearing  observation  noise  variance 
P(2,2)  »  the  bearing  covariance  term  from  the 
covariance  prediction. 

For  each  subfilter,  this  computation  would  result  .‘n  a 
(chi-square)  density  function  with  one  degree  of 
freedom  if  the  problem  were  truly  linear  and  Gaussian 
191.  For  the  correct  filter,  the  expiected  value  of 
this  random  variable  is  unity,  indicating  that  the 
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*ctu*l  bearing  errors  are  of  the  mm  magnitude  as 
the  predicted  bearing  errors  plus  the  observation 
noise.  This  statistic  should  have  a  higher  value  for 
all  other  incorrectly  matched  filters.  In  order  to 
smooth  this  performance  test  and  look  at  the  recent 
filter  history,  the  actual  statistic  implemented  was 

x(t)2  •  alpha  •  R£S2/(R  +  P(2,2) ) 

+  (1  -  alpha)  *  x(t-l)2 

where  by  setting  alpha  *  1,  the  most  recent  ten  minutes 
of  performance  were  evaluated.  The  logic  usea  to  turn 
off  diverging  filters  was  set  in  the  range  X*2  (1,5J. 
This  range  of  value  was  chosen  to  obtain  the  "best" 
10-30  filter  solutions  by  the  end  of  the  run. 

IV.  The  Strategy 

4.1  Simulation  Options 

in  comparison  with  decisions  already  made  concern¬ 
ing  model  and  estimator  implementation,  picking  a 
limited  number  of  meaningful  simulation  scenarios  was 
much  more  difficult.  The  clear  desire  in  this  case  was 
to  select  realistic  strategy  options  which  would 
provide  Che  most  "insight  per  run"  and  still  keep  then 
simple  enough  so  as  to  minimize  any  coupling  which 
could  cloud  their  relative  b  nefits. 

One  of  the  most  important  deicisions  a  searcher 
must  make  after  an  initial  detection  is  how  to  maneuver 
so  as  to  acquire  as  much  information  about  the  contact 
as  possible  in  the  shortest  period  of  time.  In  the 
bearingseonly  case, it  is  clearly  important  for  the 
searcher  to  maneuver,  influencing  the  bearing  inform¬ 
ation  received  in  such  a  manner  that  the  set  of  feasi¬ 
ble  target  states  is  reduced  as  much  as  possible. 

Often  such  maneuvers  are  limited  to  course  rather 
than  speed  changes  since  ties  searcher  would  like  to  run 
at  a  speed  whicn  due  to  self-noise  provides  him  the 
highest  acoustic  advantage.  Thus,  a  simplified 
version  of  this  strategy  decision  would  be  at  what 
angles  and  at  what  rate  should  course  changes  occur 
for  optimal  target  tracking. 

A  second  and  slightly  more  obscure  factor  of 
interest  is  the  initial  aspect  of  rncounter  between 
the  target  and  searcher.  At  first  glance,  this  factor 
appears  to  be  a  poor  simulation  variable  rince  under 
most  conditions,  the  searcher  has  very  little  control 
over  the  manner  in  which  he  first  confronts  the  target. 
However,  the  initial  geometry  of  target  detection  is 
important  since  by  studying  tracking  performance  under 
these  different  initial  conditions,  the  more  advan¬ 
tageous  target-searcher  aspects  can  be  determined,  ay 
using  this  information,  the  searcher  can  improve  his 
tracking  performance  oy  ma  muvering  into  a  more  favor¬ 
able  target-seerc'ner  aspect  th-’o,  first  encountered. 

The  factors  already  discussed  are  ixpor  it  prima¬ 
rily  due  to  their  geometries  Another  factor  which  is 
of  particular  importance  due  to  the  complex  acoustic 
environment  already  discussed  is  the  propagation.  path 
by  which  the  signal  is  received.  Based  only  on  bearing 
information,  the  difference  in  the  initial  range  of 
detectior  between  a  direct  path  and  a  convergence  zone 
contact  could  be  a  factor  of  4  times  or  mors.  We  would 
expect  the  most  desirable  strategies  to  be  quite 
different  between  the  different  path  and  convercence 
zone  propagation  cases  w.'  th  range  differences  this 
large.  Therefore,  tne  propagation  path  is  also  includ¬ 
ed  as  one  of  our  simulation  variables. 

4.2  The  Simulation 

The  simulation  consisted  of  a  maneuvering  searching 
unit  and  a  constant  velocity  targe,..  The  simulation 
was  run  using  three  different  target-searcher  aspects; 


1.  initial  detection  on  target's  beam 

2.  initial  detection  45  degrees  off  the  target's  bow 

3.  initial  detection  45  degrees  abaft  the  target's 
beam 

This  discretization  of  relative  aspects  was  chosen  so 
as  to  cover  a  major  portion  of  the  target's  azimuth 
while  holding  the  Increase  In  the  required  number  of 
runs  to  a  factor  of  3. 

The  searcher  pointed  the  initial  contact  bearing 
and  commenced  a  zig-zag  search  on  that  base  course. 
Three  zig  angles  were  used  from  the  base  course. 

1.  30  degrees 

2.  60  degree* 

3.  90  degrees 

In  addition,  three  different  zig  times  were  used  during 
the  simulation. 

1.  maneuver  every  5  minutes 

2.  maneuver  every  10  minutes 

3.  maneuver  every  15  minutes. 

These  maneuver  angles  and  frequencies  were  chosen  since 
they  seem  to  bound  the  most  interesting  range  of  values. 
Each  of  these  27  runs  was  made  under  two  different 
initial  range  conditions.  One  run  wau  made  with  the 
initial  detection  made  at  a  range  predicted  by  direct 
path  acoustic  propagation.  The  second  run  was  made 
with  the  initial  detection  made  at  a  range  predicted 
by  convergence  zone  acoustic  propagation.  The  probabi¬ 
lity  of  detection  and  range  results  from  Section  1 
were  used  for  these  two  propagation  paths. 

The  implemented  target  is  radiating  a  S00  Hz  tore 
at  a  source  level,  of  160  dB  11)  •  The  tr an  miss,  ion 
loss  Is  computed  using  the  sonar  equation  with 
spherical  spreading  out  to  a  transition  range, 

Ro  *  3000  yds,  and  cylindrical  spreading  beyond,  plus  a 
loss  proportional  to  range.  A  noise  background  equiva¬ 
lent  to  that  of  the  deep  sea  in  sea  state  3  on  a  quiet 
searching  platform  implies  NL  »  66  dB.  The  receiver 
characteristics  are  modeled  uslnq  incoherent  energy 
processing  In  a  receiver  band  100  Hz  wide  having  a 
probability  of  detection  equal  to  .5  with  a  .01  false 
alarm  probability.  The  required  receiver  observation 
time  under  these  constraints  is  IS  aecondz.  Using  an 
absorption  coefficient  of  .4  dB/kyd  and  the  passive 
sonar  equation,  trasmission  loss  '■  86  dB. 

A  convergence  zone  with  a  10  dB  gain  centered  on 
30  miles  Is  assumed  with  an  intermittent  contact  pro¬ 
bability  of  .75.  For  ease  of  implementation  a  step 
approximation  to  the  curve  of  Figure  4.1  is  used  with  a 
worst  case  probability  of  .5  for  the  direct  path  case. 
The  absorption  coefficient  for  both  the  direct  path 
and  convergence  zcr.e  races  is  .4  dB/kyd.  The  expected 
direct  path  range  of  detection  is  19  kyds.  The  result¬ 
ing  probability  of  detection  as  a  function  of  range 
curve  used  in  the  simulations  is  depicted  In 
Figure  4.i. 


Range (nautical  miles) 
Figure  1 
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4.3  Si  .-item  Validation 

rhe  most  encouraging  result  was  that  In  all  54 
simulation  runs,  the  filter  that  was  hypothesized  to  be 
the  correct  filter  based  on  the  chi-square  statistic 
was  either  the  "correct"  filter,  000  degrees  true  at 
6  knots,  or  an  adjacent  filter.  The  only  qualifi¬ 
cation  to  this  su.-cess  was  that  the  estimator  was  not 
always  able  to  distinguish  between  the  best  direct 
path  and  convergence  zone  solutions.  However,  this 
inability  to  distinguish  between  the  two  propaga¬ 
tion  paths  is  a  very  real  and  important  problem  which 
is  encountered  in  actual  passive  acoustic  search 
cases. 

The  major  problem  with  the  estimator  resulted  from 
its  instability  at  ranges  less  than  about  3  n.m.  At 
these  relatively  close  ranges  the  non-linear  aspects 
of  the  geometry  tend  to  dominate  and  the  linearized 
covariance  prediction  became  inadequate.  For  this 
reason,  most  of  the  data  for  the  direct  path  cases  was 
taken  after  only  15-30  minutes  into  the  run  while 
ranges  were  still  greater  than  3  n.m.  Althought  the 
convergence  zone  run  data  was  all  taken  after  60  minu¬ 
tes  into  the  run,  the  convergence  zone  had  a  much 
greater  incidence  of  intermittent  contact,  receiving 
bearing  information  only  about  75%  of  the  time.  Given 
these  liminations,  at  is  not  surprising  that  the 
"correct"  filters  in  each  case  were  unable  to  signifi¬ 
cantly  reduce  their  initial  epeed  errors.  In  the 
direct  path  case,  range  errors  were  reduced  in  rela¬ 
tively  short  periods  of  time.  However,  in  the  converg¬ 
ence  zone  runs,  with  initial  range  errors  as  small  as 
3%,  tne  estimator  was  only  able  to  maintain  the  error 
margins  approximately  this  small. 

While  clear  differences  in  filter  performance  with 
respect  to  speed  and  rarg  estimates  were  absent, 
there  was  a  clear  difference  in  filter  estimates  of 
target  course  for  different  initial  target-searcher 
geometries.  In  particular,  the  searcher's  estimates 
of  targer  course  during  the  beair.  aspect  runs  were 
consistently  in  error  by  3-5  times  the  errors  en¬ 
countered  with  the  other  two  initial  geometries. 

This  difference  is  again  quite  consistent  with  actual 
practice  since  a  searcher  presented  the  beam  aspect  of 
a  target  is  in  the  worst  possible  location  to  detect 
relatively  minor  changes  in  course  (or  course  errors) . 

11.  Conclusion 

The  primary  purpose  of  this  project,  to  actually 
build  a  realistic  ocean  acoustic  model  and  estimator 
for  open-loop  search  strategy  formulation,  was 
accomplished.  The  various  simulations  which  were  run 
confirm  the  feasibility  and  performance  of  the  system 
as  a  tool  to  be  used  in  further  strategy  assessment. 
Even  though  any  comprehensive  strategy  formulation 
would  ha ’e  to  be  based  on  using  this  system  on  a  large 
number  of  Monte  Carlo  type  runs,  several  strategy 
concepts  suitable  for  a  more  global  closed-loop  imple¬ 
mentation  came  to  light  with  only  a  limited  number  of 
„  uns . 
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Summary 


Mx^n),  m  zero-neanf  uncorrelated  Gaussian 

additive  noise  lor  the  nth  rath 


—-The  hasic  problem  in  ocean  detection  of  narrowband 
coustic  siqnals  is  studied  for  unsatur^ted  sound  pro- 
aaation,  whereas  considerable  proqress  has  been  made 
owards  modeling  the  more  qeneial  case  of  partially 
aturated  acoustic  detection  process.  Detection  is  de- 
ined  as  occurrinq  whenever  p  ,  the  }short-time  averaoe 
oot  mean  square  pressure  at  the^.jpe£eiver ,  exceeds  a 
pecified  threshold  level  ,o0  .  ^lln  this  paper,  a  two- 
tate,  discrete-time  ftarkov  model  is  first  derived  for. 
.he  unsatunted  ocean  acoustic  detection  process, 
‘losed-form  expressions  for  the  probability  mass  func- 
:ions  (PMF*s)  of  the  number  of  time  steps  separating 
;ither  two  successive  detections  ( interarrival  time)  or 
me  detection  and  the  first  subsequent  "downcroFsing" 


Fxirthermore,  the  envelope  and  the  phase  of  the  total 
siqnal  are  defined  as: 

0  -  (*2  ♦  *2)1/2 

(?) 

*  «  tan-^y/X) 


At  short  ranoes  and  low  frequencies,  or  for  stable 
channels,  the  propaqation  is  said  to  be  unsaturated  and 
the  probability  density  function  (PDF)  of  p  is  Rician 
and  independent  of  tre  number  of  paths  111.  (In  sec¬ 
tion  2  the  distributions  of  p  and  its  phas*.  4  are 
presented)  • 


holding  tine)  are  presented.^  Expressions  for  th* - s, 

lolnt  probability  density  function  (PDF)  of  p  at  two 
lifferent  points  in  time  are  obtained  and  used  to  de- 
:ermine  the  relevant  one-step  transition  probabilities 
>f  the  Harkov  model.  Results  usinq  the  model  for  vari¬ 
es  values  of  it*.  inouc  parameters  are  also  presented 
ind  discussed  .v  Another  Markov  model  is  next  derived 
for  Partially  saturated  narrowband  acoustic  signal  pro¬ 
pagation,  and  closed-form  expressions  for  the  PMF’s  for 
the  relevant  interarrival  and  holdtnq  times  are  presen- 
fed^  The  joint  PDF  of  p  at  two  different  points  in 
time  is  obtained  from  a  rather  oeneral  conditional 
PDF  already  derived  hv  Middleton  and  checked  to  re- 
Juce  to  the  limitinq  cases  of  the  fullv  saturated  and 
jnsaturated  processes.  Future  research  nw*v  include  a 
iirect  derivation  of  thj  paitially  saturated  model, 
comparing  both  models  with  data  available  to  us  as  wall 
as  with  appropriate  versions  of  the  current  memoryless 
"state-of-the-art*  (A,  o)  model,  and  finally  using  each 
model  in  resource  allocation  schemes  for  target  track¬ 
ing,  alonq  the  lines  of  previous  research  of  tho  first 
two  authors . 

1 «  Introduction 


In  general  the  quadrature  components  of  the  enve¬ 
lope  of  a  narrowband  c  c«an  acoustic  multipath  process 
are  given  by  ( l J 

N 

X  *  l  (rncos6n  +  Nx<n))  . 
n*1 

(1) 

N 

v  »  l  <inrinen  *  Nv<-’>)  . 
n*1 

where 

N  *  number  of  independent  paths  between  source 
and  receiver 

rn  ••  the  amplitude  of  the  path 
6n  *  the  phase  of  the  nth  oath 


At  sufficiently  lonq  ranoes  and/or  high  freouen- 
)  cies,  the  propaqation  is  fullv  saturated,  which  means 
l  that  t  #  the  phase  of  p  ,  is  uniformly  distributed 
between  0  and  2w  ,  or  each  path  has  a  phase  tin  that 
is  normally  distributed  with  a  standard  deviation 
>  2*  .  In  this  reqime  when  N  >  4  and  the  single 
path  amplitudes  rn  are  approximately  equal,  phase 
random  multipath  nronaoation  is  obtained.  It  has  been 
found  [3]  that  the  envelope  p  of  a  fullv  saturated 
phase  random  process  obeys  a  Rayleiqh  PDF.  Moreover, 
several  other  statistics  and  joint  PDF’s  for  the  phase 
random  process  have  bee,?  obtained,  and  are  summarized 
in  (fij. 

At  intermediate  ranges,  where  the  siqnal  experien¬ 
ces  enough  perturbations  in  the  channel  so  that  each 
%  can  be  characterized  as  a  Gaussian  random  variable 
b\.t  with  a  standard  deviation  <  2w  ,  partially  satura¬ 
ted  propagation  is  obtained.  The  f “equencv/ranqe 
boundaries  between  the  unsaturated,  partially  satura¬ 
ted,  and  fullv  saturated  reqimes  are  dependent  upon  the 
ocean  dynamics  or  boundary  dynamics  of  the  propagation 
channel,  as  well  as  the  magnitude  of  any  relative 
source -receiver  motion.  Envelope  statistics  for  sig¬ 
nals  in  the  partially  saturated  regime  are  presented  in 
Ml.  An  tine  variance  of  the  single  path  phase  goes  to 
zero,  or  becomes  larqe,  the  first  order  PDF’s  converge 
to  the  unnaturated  and  fullv  saturated  results  resjec- 
r.ively  (11. 

In  previous  publications  (7,Rl  of  the  authors, 
continuous  and  discrete- time  detection  models  using  the 
results  of  phase  random  acoustic  propaqation  (3,61 
have  been  formulated.  "Detection*  was  defined  as  an 
uncrossing  of  random  variable  p  (the  root  mean  square 
pressure  at  the  passive  sonar  receiver)  over  a  speci¬ 
fied  threshold  pp  .  A  continuous -time  model  was  first 
developed  for  obtaining  the  PDF’s  of  the  tine  between 
two  successive  detections  (interarrival  tine)  and  of 
the  tine  between  a  detection  and  the  first  sunseouent 
downcrossina  through  pp  (holding  time).  The  model 
w«s  then  compared  with  the  extensively  used  (\,o)  model 
and  tdth  available  acoustic  data.  This  model  was  seen 
to  exhibit  similar  long-term  behavior  Nit  markedly  dif¬ 
ferent  short-term  characteristics  as  compared  with  the 
(A,o)  model,  a  fact  which  is  due  to  the  memory  of  the 
process.  Comparison  with  data  has  demonstrated,  in 
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■oat  C'sei,  a  significantly  improved  prediction  cape 
bility  over  the  (X,o)  model. 


(5) 


b  ■  proh  (pi  >  Po  I  P2  <  Po> 


Subsequently,  a  two-state  and  a  four-state  dis¬ 
crete-time  ftarkov  detection  model  wr»  developed  and 
closed-form  expressions  for  the  probability  mass  func¬ 
tions  of  the  corresponding  interarrival  and  holding 
times  were  derived.  The  results  obtained  using  the 
latter  models  w*re  favorably  compared  with  both  the 
continuous-time  models  and  the  data,  the  greatest  im¬ 
provement  over  the  conti nuous-time  models  lying  in  the 
much  lover  computational  effort  involved. 


In  the  memoryless  case,  a  +  b  ■  - 1  *  and 

a  -  proh(p2  <  pol pi  >  Po>  *  nroh(p2 

<  OoIpi  <  Po>  *  Prob<p2  <  pn> 

PO  *S  PO 

or  a  m  •  fp(p)  dp  •  1  -  0  ,  — } 


In  this  paper,  discrete-tine  detection  models  are 
developed  for  the  unsaturated  case,  first  for  the  *m<*m- 
oryless*  case  and  then  for  the  qeneral  "memory”  Markov 
case.  A  discrete-tine  detection  model  is  al  o  proposed 
for  the  partially  saturated  acoustic  propagation,  Wh.-cu 
is  shown  to  reduce  to  the  two  linitinq  cases  cl  fully 
saturated  and  unsaturated  processes.  Conclusions  ar.d 
suggestions  for  future  research  form  the  final  section 
of  the  paper. 


with  fp(p>  as  in  (3), 

and  tie  generalized  O-function  defined  in  151  as 

*  5  a2  +  E2 

C>H<a,$'  «  /  E  (-)  dxo  -  ■  — 1  —  Ih-i  (<»£>  dE 

6  a  2 

M  -  1,2,... 


2.  Modeling  the  Unsaturated  Detection  Process 

The  probability  density  functions  for  the  root- 
mean  square  pressure  p  and  its  phase  ,  for  the  un¬ 
saturated  process  are  derived  in  Ref.  1.  The  density 
of  p  is  Rician: 


from  (fi) ,  we  can  proceed  to  evaluate  the  proba¬ 
bility  mass  functions  for  the  interarrlvcl  and  holding 
time.  In  general,  theoe  PMF's  take  the  form  [9i 

PK(k)  »  (1-a)k_1a,  k  ■  1,2,... (holding  time) 


fp(p)  -  — ;  exp  (- 


P2  ♦  *s2. 


PRS 

Io(— 5)  .  0  <  p  <  - 

OfT 


P;(n)  -  -  t(1-h)"-’  -  (1-a)"-»), 


the  magnitude  of  the  constant  signal  vector 


n  »  2, 3, ...(interarrival  time! 


In  the  memoryless  case,  (9)  becomes 


•  s-odified  Bessel  function  of  the  first  kind  of 
order  n 


»n2  -  n°N,2  -  n°N2  .  wher* 


°NX  ,  °Ny  ■  the  variances  of  Mx*n*  *"<1  Ny*n)  , 
respectively,  as  defined  in  the 
introduction 

In  our  previous  Harkov  modeling  of  the  phase  ran¬ 
dom  process  (fully  saturated  sound  propagation),  a  two- 
state  model  and  a  four  state  model  were  developed, 

(7,8)  Comparison  with  data  (8)  has  revealed  that  both 
models,  when  properly  calibrated,  yield  verv  satisfac¬ 
tory  results,  the  two-state  being  consistently  as  ac¬ 
curate  as  the  four-state  model. 

We  will  henceforth  restrict  ourselves  in  develop¬ 
ing  a  tvo-stute  Markov  model  for  the  unsaturatedl  pro¬ 
cess  of  the  qeneral  fon«  shown  in  Figure  1.  \ 


I ( 1-a)n“1  -  a»-l]  ,  n  -  2,3,... 


In  the  non-trlvlal  case  (a*MII  the  calculation 
of  the  transition  ptohahiiitles  requires  knowledge  of 
the  joint  density  function  fp,p-(Pl«P2>  •  *i, 
second-order  density  has  already  been  derived  in  a 
rather  general  form  hv  Middleton's  (91  treatment  of 
the  statistical  properties  of  additive  narrowband  sig  ¬ 
nal  and  normal  noise  processes. 

Using  Middleton's  results  and  after  extensive 
algebraic  manipulations,  we  obtain: 

r  ,  ,  PlP2  ,  P,Z  *  p22  , 

fPlP2  'p1'P2>  ”  — - . —  exp  f-  . — ) 

gnw(1-r02)  2gN2(l-r02) 


,  1 
—  (vTw>  ' 


\ 


Figure  1,  TVo-state  di-crete-time  Harkov  model 

where  "0"  *  "up"  state,  defined  by  p  > 

"D*  *  "down"  state,  defined  by  p  <  p 
a  «  proh  (p2  <  po  I  Pi  >  Po) 


•  r0  PlP2  *0Pl 

n-o  PM2(1-roz>  ow?(1+ro2) 

*0P2 

Tm  {—rr — -)  <ii) 

O(jz(1+ro> 

where  e  ■  1  ,  em»2,m«1,2,... 

It  is  reasonable  to  expect  that  (11)  will  renuce,  for 
t  *  *  ,  to  the  product  ^Pi*p1^  •  ^P2^P2^  . 

Since  rr,  *  0  for  t  ♦  »  (uncorrelatedness) ,  (11) 
gives: 


- 


•  ••  •  V  .*  , 


AV-' 


/ 


«*r 


P,?2  p12  ♦  P22  +  2Afl2 

fPlP2  <p’*p2'  *  T-  «*P  (-  - - — 5 - ) 

V  20f,2 

\,0,  V)p2 

io  (— r)  1(3  (— J-) 

°N2  «HZ 


nee  Im(x)  « 
x*0 


(x/2)B  0,  m  *  0 

— — — »  V 

"I  1,  «l  ♦  0 


.  (12)  can  he  rewritten  aa 


‘l'p2  *p',p2*  *  — —  exp  (~ 


p12  ♦  *02  ftOp2 


■)  *0  C~2*)  — 5 

dHZ  <JNZ 


-)  *o  (-r) 


•e.  ^pi'pa  (Pl*P2^  *  ^p<'Pl>  •  Vpg*p2' 


P21  *  1  -  ~ 
pu 


P11  *  1  -  p12  >  p22  *  1  “  p21  (23) 

where  Pu  is  the  (unconditional)  probability  of  p 
being  lea3  than  po  .  efforts  to  simplify  the  evalua¬ 
tion  of  ^  in  Rb.  (20)  were  not  successful.  The 
double  numerical  integration  of  a  function  involving 
the  Infinite  sum  of  products  of  three  modified  Bessel 
functions  was  expected  to  and  did  actually  produce 
computational  prohlems  (excessive  CPU  time).  These 
were  partially  alleviated  uslnq  the  asymptotic  proper¬ 
ties  of  the  Bessel  functions  involved,  in  determinino 
the  tolerances  employed  in  terminatlno  the  evaluation 
of  the  infinite  summations.  Still,  for  extreme  (that 
is,  too  small  or  too  larqe)  detection  thresholds,  the 
computational  effort  is  unaceeptahly  laroe.  However, 
this  is  not  expected  to  be  a  problem  in  practice, 
since  we  do  not  iv  ed  to  use  such  extreme  thresholds  - 
in  fact,  they  result  in  memoryless  Markov  Models  and 
the  problem  does  not  exist,  since  the  evaluation  of 
the  relevant  one-step  transition  probabilities  re¬ 
quires  the  knowledqe  of  lust  the  unconditional  distri¬ 
butions  . 

Implementing  the  Model 


e  can  now  proceed  to  evaluate  the  one-step  transition 
robabilities  of  the  Markov  model. 

PO  *  • 

,2  -  •  -  /  /  fp,P2<Pl-P2>  dp1  dp2  /  /  fp  (p)dp 

0  Po  PO  Oh) 

po  po  po 

21  -  b  -  /  /  fp,P2<P1.P2>  dp1  dP2  /  /  fp  (p)dp 

00  0  07) 


ind  Pn  »  1  -  Pi  2  «  p22  *  1  *  P21  •  Oft) 

die  double  integrals  in  (16)  and  (17)  can  be  evaluated 
is  functions  of 

p0  PO  „  ,  ,  .  . 

Li"/  /  fPlP2  <p',p2>  dPldP2  (.3) 

0  0 

Mthouqh  )y  is  symmetric  with  respect  to  p|  and 
P2  ,  it  cannot  be  expressed  as  a  product  of  one  func¬ 
tion  of  p |  and  one  of  p2  .  Instead,  we  can  rewrite 
(20),  taking  (11)  into  account,  as  follows! 

.  fp°  P1  ,  p|2 _ 

o  on'O-po2)  2qn2M-po2!  U(:2Otpo) 


fp0  ,  p2  ,  -  _  ,  PoPlP2  , 

/  p2  vxp  (-  2  , ,  ? i  t  7m  (  2 i ' 

0  20|j2(1-poz)  m«0  o«jz<1-P0Z* 

A0P,  *0p2  , 

*■  ( — Jr- - r:)  W  (-57? - 57)  dp2  dpi  (2°> 

O;,2(1+P0Z)  0{JZ(  1+P0Z  ) 

In  the  above,  Aq  is  identical  to  Rs  of  eon.  (3). 

Having  evaluated  ^  ,  eqns.  (16)  and  (17)  can  be 
expressed  as  t 

-  h 


Hie  Markov  model  for  the  unsaturated  detection 
process  was  used  with  a  variety  of  -  hopefully  appro¬ 
priate  -  inputs  for  the  parameters  Involved,  namely 
Of!2  ,  v  ,  Rg  ,  T  and  pg  .  No  comparisons  of  our 
predictions  with  real  acoustic  data  that  could  be 
appropriately  modeled  as  unsaturated  sound  propagation 
are  presented.  However,  we  do  have  such  data  at  our 
disposal  and  we  are  planninq  to  use  them  to  that  ef¬ 
fect  (see  section  4). 

Fias.  3-4  present  typical  results  using  of1z  « 

I .SR  ,  Rs  •  2.23  ,  (Rs2  «  5.)  V  •  .2Hz  ,  a  time  step 
of  .4  sec  and  thresholds  p0  »  1.5R,  2.37,  e-d  3.16. 

In  Fias.  3  and  4,  and  for  the  p0  «  3.16  threshold 
only,  hoth  the  histogram  and  the  "outline*  (i.e.,  the 
continuous  line  passing  through  the  too  midpoint  of 
each  her  in  the  histogram)  of  the  corre*-  oi.dinq  BMP 
were  drawn.  For  the  other  two  thresholds,  only  the 
"outlines"  were  plotted,  since  including  their  histo¬ 
grams  wauld  most  probably  reduce  the  legibility  of 
these  figures.  Of  these  values,  o«,2  and  RK  were 
picked  from  an  unsaturated  example  (1)  and  the  rest 
were  chosen  by  the  authors  and  are  more  or  less  arbi¬ 
trarily.  It  is  clear  that,  although  the  density  of 
the  holdinq  time  is  very  sensitive  to  the  magnitude  of 
the  detection  threshold,  the  density  cf  the  Intereri- 
val  time  is  much  less  threshold  dependent.  This  re¬ 
minds  us  of  idiat  we  would  get  in  a  pure  sinusoidal 
signal  situation,  where  ws  have  a  constant  interarri¬ 
val  time  (egual  to  the  period  of  the  sinusoidal  sig¬ 
nal)  but  different  holdinq  times  for  each  threshold 
( Figure  2). 

Such  a  result  was  not  observed  in  cur  previous 
study  of  the  detection  process  using  the  phase  random 
model  for  ocean  propagation  [8! .  In  the  unsaturated 
case,  p(t)  is  obviously  net  a  strict  sinusoid  and 
hence  ws  do  not  get  the  above  4  -  function  densities 
for  the  interarrival  and  holding  times.  Still,  Fig.  4 
shows  that  tha  timing  of  datection  svents  (i.e.  the 
distribution  of  the  lnteratrlval  times)  is  almost  in¬ 
dependent  of  the  detection  threshold.  This  threshold 
makes  its  presencs  felt  only  in  the  distributions  of 
the  holdinq  time,  in  which  we  quite  obviously  have 
shorter  noldlng  times  for  hiqher  thresholds. 

Figures  5  and  6  demonstrate  the  relative  insen¬ 
sitivity  of  ths  above  results  to  changes  in  the  time 
step  T  .  Figs.  9  and  10  correspond  to  T  ■  .8  sec., 
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or  twice  that  of  Flqa.  3  and  4  with  which  they  have 
all  other  inputs  in  common. 


Finally,  Fins.  7  and  8  present  the  results  ob¬ 
tained  usinq  the  unsaturated  model  with  Rg  -  0.0  and 
other  inputs  as  in  Fine.  3-4.  Typical  unsaturated 
propagation  Involves  a  larqe  Rg  (constant  vector) 
and  additional  Gaussian  noise,  and  the  Rg  »  0  case 
could  he  perhaps  more  “naturally*  described  by  a  phase 
random  model. 


Fiqure  2.  Sinusoidal  Siqnal,  Interarrival  and  Holdlnq 
Time  PDF's 


Fiquie  5.  Holdinq  Time,  pg  •  3. IS,  T  «  .4  sec., 
Rs  -  2.23 


9.  .12. 

t (sec) 


Fiqure  3.  Holdlnq  Time,  pg  »  1.58,  2.23,  3.16  , 
T  -  .4  sec.,  Rg  *  2,23 


0.  5.  10.  15.  20. 


40.  50. 

t(sec) 


Fiqure  6.  Interarrival  Time,  pg  »  3.16,  T 
Rg  •  2.23 


3.  6.  9.  12.  15.  18.  21.  24. 


Interarrival  Time,  Pg  »  1.58,  2.2i,  3. 16  , 
T  -  .4  sec.,  Rg  -  2.23 


9.  12. 

t (sec) 


Fiqure  7.  Holdinq  Time,  pg  -  1.58,  T  »  .8  see 


Rs  -  2.23 

3 .  Modeling  the  Partially  Saturated  Detection  Process 


7l2p1 

°N2(1'k02> 


721P2 

Tm  - T7T—7-2.  cos  n  <*21-'t'32-*-*0) 

on2<i-kn2> 

(25' 


with  ♦  not  explicitly  appearing  on  the  riglit  hand 
aide,  and 

~  ti2  “  t*l2+k02A22“?k0A1k2co9f£2  “  E1  ♦  ♦0>11/2 
|hoh2slnf£2  +  ♦<>)  "  Ai&ineil 

-  <|12  ■*  tan~ *  |  —  —  -  ...  - —  —1  ,[ 

|AlCOSCi  -  k{)A2COs(E2  +  *0)1 

“  <(>21  nod  Y21  are  obtained  by  Interchanging 
subscripts  1  and  2 

~  *2^*1 *A2'e1 >e2'  *  exp{-[A1?+  A22  -  2kpAiA2 
eos( e2  -  I,  +  $0)J/2ofl2(1  -  kn2)} 

-  to  *  tan-,[Xo(t)/po(t)l 


Following  the  notation  of  the  previous  section, 
the  first-order  PDF  of  p  for  1  partially  saturated 
process  has  been  derived  [11  and  Is  of  the  form. 


P  |  -1 


2»dxgy(1-pxy2)l'2 

(texp - — 

I  2( 1-0xy  ) 

P2+2|)x2  2pxy|ix|iy 

P24*2Uy2  1 

-  -  .  t  - 

2  X  ax°y 

20y2  | 

?»  -1 

/  exp  -  ■■■ . [a  cosf 2+)  ♦  b  sln(2+)  +• 

0  2(1 -p,y2) 

c  cos  (4)  d  sin( 4) )  d$  (24) 

where* 


-  ko  -  (r02(t)  +  X02<t))l'2 

-  r„(t)  -  WNc^c^/pn2 

-  X0(t)  »  K(NC  N.  )/gN2 

1  2 

-  en  «  uyt  -  <»(  t>  ) 

-  “d  *  “c  *  “0 

Since  a  partially  saturated  process  essentially 
consists  of  an  additive  narrowband  signal  with  Gaus¬ 
sian  noise  superimposed,  (25)  should  serve  as  our 
second-order  density  [121. 

Necessary  conditions  for  this  to  he  true  is  that 
the  second-order  partially  saturated  density  should 
reduce  to  the  densities  of  the  limiting  cases  of  fully 
saturated  and  unsaturated  processes. 


a  ■  p2(1/2ox2  -  1/2oy2) 


2  p*y 

OxOy 

2pxyliy 

2ux 

°x°y 

Ox2 

2pxyuy 

2py 

°x°y 

Oy2 

Fcr  a  memoryless  Markov  model  tor  the  unsaturated 
process,  (24)  is  adequate.  for  the  general  (memory) 
case,  however,  we  need  to  know  the  joint  PDF  fPiP2 
(pl«P2)  •  This  is  given  by 

fp.02*Pl'P2)  ■*  /  fPiP2(Pi  ,02!$!f$(*)d4  ,  where 

* 

f(Pl#P2l*l  conditioned  upon  a  general  set  of 
parameters  is  obtained  in  [9)  as: 


Three  assumptions  are  necessary  to  reduce  Tb .  (2) 
to  the  unsaturated  case:  1,  the  signal  is  an  ensem¬ 
ble  of  unmodulated  sinu-solds  (then  A^  .  I2  ”  N)  , 

2«  *c  “  *0  *  (thus  tdtf  •  0)  ,3.  the  noise  spec¬ 
trum  is  symmetrical  about  fc  >  f0  .  It  then  follows 
that: 

X0(t)  «  0  ,  ko  »  ro(t)  ,  and  $(,  «  0 
and  the  following  simplifications  occur: 

£1  "  e2 

B2  «  exp( - Ap2/ [ oN2 ( rp  ♦  1)1) 

*12  “  *21 

712  -  721  -  (A02(1+  r02  -  2r0)ll/2  -  A<,(1  -  r0) 
Finally,  (26)  becomes; 

fp1P2(p1'p2>  “  [p1P2/ON,'<1-£02)l 

exp(-p12+p22)/[2oN2(l-ro2)l)exp(-A02/[oN2(ro+l)))x 


plP2 

fplP2<Pl>P2l*>  ■  — T7 - 77 

gN‘(1-k02) 


-(p12+p22) 

2gN2(1-k02) 


1  EmIro<rOPlP2/<,N2<1”r02>Im(A<>Pl/ON2<rot1  >) 

2m(AOP2/pN2(rO+)))  (26) 


B2(A,,A2.«,.52) 


£  £m  2m 


k0PlP2 

ON2(1-ko2) 


(26)  is  the  second-order  PDF  for  an  unsaturated 
process,  coinciding  with  (26)  of  [121. 

For  reduction  to  the  fully  saturated  case,  two 
assumptions  are  necessary.  Since  observed  S/N  ratios 
for  fully  saturated  samples  are  very  low,  the  signal 


can  ba  nod" lied  aa  pure  noise.  Thus  1)  •  *2  ■  0  • 
Also,  the  noise  spectrum  Is  symmetrical  about  fc  ■ 
fg  ,  hence 

X0(t)  -  0  ,  kg  -  r0(t)  ,  and  4o  “  0  • 

It  then  follows  thatt 

*1  “  *2  *  “d  “  “c  '  >2  ■  '  •  »n<1  Yl2  *  Y21  *  <*■ 

And  since  I„(0)  ■  0  mao,  Im( 0)  •  1  ,  m  •  0  ,  the 
second-order  POP  ,  Equation  (2)  becomes t 

fP.P2(p  1»®2>  *  IPlP2/ON,,<1-r02)>  exp(-tpi2+P22]/ 

1 20(j2 ( 1  -ro2 )  1 )  lot roPi P2/ ®N2 1 1  _r02 )  1 

(2-») 

Equation  (27)  Is  the  second-order  PDF  for  a 
fully  saturated  process,  as  given  in  equations  (9.20) 
of  (9)  and  (68)  of  (3). 

Thus,  we  have  met  necessary  conditions  for  (25) 
to  be  the  second-order  PDF  for  a  partially  saturated 
process.  It  remains  to.  be  seen  whether  tills  1s  also  a 
sufficient  condition. 

Another  necessary  condition  is  that  the  second- 
order  PDF  ,  Equation  (2),  must  reduce  to  the  first- 
order  PDF  .  (24),  when  inteqrated  over  pj  or  pj  . 

4.  Conclusions  and  Future  Research 

In  Oils  paper,  an  analytical  model  for  the  unsa¬ 
turated  acoustic  detection  process  was  presented  and 
probability  mass  functions  for  the  Interarrival  and 
holding  times  were  derived,  the  unsaturated  mode  of 
acoustic  propagation  was  seen  to  exhibit  different 
characteristics  than  the  previously  developed  phase 
random  acoustic  detection  models.  A  major  difference 
between  these  two  modes  of  acoustic  propagation  lies  in 
the  narrower  (for  the  unsaturated  case)  distribution  of 
p  ,  which  approaches  a  normal  density  aa  Rg  qrows 
large,  A  more  striking  difference  lies  in  the  relative 
independence  of  the  interarrival  time  HP  to  the  detec¬ 
tion  thre’  old  pg  for  the  unsaturated  case,  a  property 
not  observed  in  the  fully  saturated  models. 

In  the  third  section  of  this  paper,  a  discrete- 
time  Markov  model  for  the  partially  saturated  detection 
process  was  presented.  The  second-order  PDF  of  p 
has  been  conjectured  to  be  one  presented  in  (9) ,  (ap¬ 
propriately  inteqrated  over  the  conditioning  set  of 
parameters  4).  It  was  also  shown  that  this  PDF  re¬ 
duces,  as  required,  to  the  11ml tinq  cases  of  fully  sa¬ 
turated  and  unsaturated  propagation.  Clearly,  once  the 
joint  PDP  is  obtained,  the  PDFs  of  the  interarrival 
and  holding  times  can  be  readily  derived. 

Several  interesting  extensions  of  the  present  re¬ 
search  could  be  pursued,  the  most  urqent  of  which  mioht 
be  the  comparison  of  both  unsaturated  and  partially  sa¬ 
turated  models  with  those  acoustic  data  records  in  our 
disposal  for  which  the  PDF  of  p  approaches  the 
Rlciar.  or  the  partially  saturated  density  respectively. 

A  relevant  problem  in  this  comparison  is  to 
devise  a  method  for  estimating  the  parameters  of  these 
distributions  from  the  data  (in  the  phase-random  case, 
v  and  02 2  were  estimated  by  fitting  a  normal 
distribution  to  ths  histogram  of  A  ,  where  A  s  10 
1®910P2  and  A  “  dA/dt) .  Once  and  if  the  ability  of 
our  models  to  predict  the  timing  of  detection  events 
ii  demonstrated,  we  can  very  conveniently  emply  them 
in  all  sequential  optimization  alqorithms  appearing  in 
(2,10,11)  for  a  more  rational  allocation  of  acoustic 
sensor  resources  In  target  tracking,  and  also  in  the 


sequential  hypothesis  best inq  alqorithms  of  (10)  that 
will  help  us  in  signal  vs.  noise,  target  vs.  several 
false  tarqets,  data  association,  tarqet  Identification 
and  other  relevant  problems.  No  chanqes  need  to  he 
made  in  the  alqorithms  of  (10)  other  than  substituting 
ths  chase  random  model  for  acoustic  propagation  with 
the  unsaturated  model  developed  in  this  paper. 
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We  d j SCU3S  the  convergence  properties  of  asynchro¬ 
nous  distributed  iterative  optimization  algor  itlims, 
tolerating  communication  delays.  We  focus  on  a  gra¬ 
dient-type  algorithm  for  minimizing  an  additive  cost 
f ■  'fiction  and  present  sufficient  conditions  for  conver¬ 
gence.  We  view  such  an  algorithm  as  a  model  of  adjust¬ 
ment  of  the  decisions  of  decision  makers  in  an  organiza¬ 
tion  and  we  suggest  that  our  results  can  be  interpreted 
as  guidelines  for  designing  the  information  flows  in 
an  organization. 

1.  Introduction 

This  paper  concerns  the  convergence  properties  and 
communication  requirements  of  asynchronous  distributed 
optimization  algorithms,  tolerating  communication 
delays.  The  results  being  presented  may  be  interpreted 
as  pertaining  to  the  performance  of  potential  parallel 
computing  machines.  Alternatively,  an  approach  which 
we  pursue  in  this  paper,  our  results  may  be  viewed  as 
a  description  of  the  adjustment  process  in  a  distri¬ 
buted  organization,  possibly  involving  human  decision 
makers.  Moreover,  it  could  be  maintained  that  the 
mathematical  models  discissed  here,  capture  some  as¬ 
pects  of  the  ever-present  "bounded  rationality"  of 
human  decision  makers  [Simon,  1900]  . 

Our  motivation  is  the  following:  A  bounded ly  ratio¬ 
nal  decision  maker  solving  an  optimization  problem 
(minimize  J(x)),  may  be  viewed  as  an  iterative  optimiza¬ 
tion  algorithm,  whereby  a  tentative  decision  x(n)  is 
made  at  time  n,  and  then  the  decision  is  updated,  in  a 
direction  of  improvement.  For  example,,  we  may  have 

x(n+l)  =  x(n)  -  y  |—  (x(n) ) ,  (1.1) 

which  corresponds  to  the  well-known  gradient  algorithm. 

By  extending  the  above  analogy  to  more  complex  settings, 
an  organization  (or,  at  least,  some  aspects  of  it) 
consisting  of  cooperative ,  boundedly  rational  decision 
makers  may  be  viewed  as  a  distributed  algorithm.  For 
example,  suppose  that  x  is  a  decision  vector  and  that 
the  i-th  decision  maker  is  in  charge  of  the  i-th  com¬ 
ponent  x^  of  x,  which  he  updates  according  to 

x.  (n+1)  =  x.  (n)  -  Y.  ~  (x(n))  .  (I.?) 

1  1  dX . 

1 

if  each  decision  maker  was  to  update  his  part  of  the 
decision  (his  own  component) ,  at  each  instance  of  time 
according  to  (1.2),  we  would  have  a  synchronous 
distributed  implementation  of  the  centralized  gradient 
algorithm.  Synchronous  algorithms  have  been  studied 
in  a  variety  of  contexts  [Arrow  and  Hurwicz,  1960; 
Gallager,  1977]  but  t’*ey  also  have  certain  drawbacks: 

(1)  Decision  maker  i,  in  order  to  update  x. (n)  accord¬ 
ing  to  (1.2),  he  needs  tc  know  x(n),  at  time  n.  This 
requires  that  each  decision  maker  informs  all  others, 

*  Research  supported  by  ONk under  Contract  ONR/N00014-77-C 


r  ;  each  time  instance,  on  the  adjustments  of  his  deci¬ 
sions.  So,  in  some  sense,  the  synchronous  model  re¬ 
quires  "a  lot  of  communications." 

(2)  A  second  drawback  of  synchronous  algorithms  is  *hat 
communication  delays  can  introduce  bottlenecks  and  slew 
down  the  algorithm.  In  particular,  the  time  between 
two  consecutive  updates  has  to  be  at  least  as  large 

as  the  maximum  communication  delay  between  any  pair  of 
decision  makers. 

(3)  Finally,  complete  synchronization  is  ct  inly  an 
Unrealistic  model  of  human  organizations. 

For  the  above  reasons,  we  choose  to  study  asynchro¬ 
nous  distributed  versions  of  iterative  optimization 
algorithms,  in  which  decision  makers  do  not  need  to 
communicate  to  every  other  decision  maker  at  each  time 
instance.  Such  algorithms  avoid  communication  over¬ 
loads,  they  are  not  excessively  slowed  down  by  commu¬ 
nication  delays  and  there  is  not  even  a  requirement 
that  each  decision  maker  updates  his  decision  at  each 
time  instance,  which  makes  them  even  moie  realistic. 

2.  General  Properties  and  Convergence  Conditions 
of  Asynchronous  Distributed  Algorithms 

We  now  discuss  the  main  principle  underlying  the 
class  of  asynchroncus  algorithms  which  we  consider:  as 
we  mentioned,  in  Section  1,  for  a  synchronous  algorithm, 
each  decision  maker  needs  to  be  informed  of  the  most 
recent  value  of  the  decisions  of  all  other  decision 
makers.  Suppose  now  that  decision  maker  i,  at  time  n, 
needs  for  his  computations  the  current  value  x.(n/ 

of  the  j-th  component  of  x,  but  he  does  not  know  this 
value.  We  then  postulate  that  decision  maker  i  will 
carry  out  his  computations  as  in  the  synchronous  algo¬ 
rithm,  except  that  (not  knowing  x.(n))  he  will  use  the 
value  of  x.  in  the  most  recent  message  he  has  received 
from  decision  maker  j .  Due  to  asynchronism  and  commu¬ 
nication  delays,  decision  maker  i  will,  in  general, 
use  out-dated  values  of  x^  to  update  his  own  decisions. 

However,  updates  besed  on  out-dated  information  may  be 
substantially  better  than  not  updating  v.c  all.  The 
crucial  questions  which  arise  are:  How  much  out-dated 
information  may  be  tolerated?  How  frequent  should 
communications  be ,  so  that  the  distributed  algorithm 
operates  in  a  desirable  manner? 

Questions  of  this  nature  have  been  addressed  by 
Bertsekas  [1982,1983]  for  the  distributed  version  of 
the  successive  approximations  algorithm  for  dynamic 
programming  and  the  distributed  computation  of  fixed 
points.  We  have  obtained  general  convergence  results 
of  a  related  nature  for  the  asynchronous  distributed 
versions  of  deterministic  and  stochastic  iterative 
pseudo-gradient  [Poljak  and  Tsypkin,  1973]  (or  "descent- 
type")  algorithms.  Some  representative  algorithms 
covered  by  our  general  results  pre  deterministic 
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Equations  (3.8),  (3.9)  together  with 
M  . 

xhn+l)  =  x1  (n)  -  y.  7  X?  (n)  (3. 10) 

1  1  i.  .  ,  X 

3=1 

specify  completely  the  asynchronous  distributed  algo¬ 
rithm  of  interest. 

Let  us  now  assume  that  the  time  between  consecutive 
communications  and  the  communication  delays  are  bounded. 
We  allow,  however,  these  bounds  to  be  different  for 
each  pair  of  processors  and  each  type  of  message: 

ik  ik 

Assumption:  For  some  constants  P  #  Q  , 

n-Pik  <  plk(n)<  n,  V(i,k)6  E,  y  ,  (3.11) 

—  —  n 

n-Q1*  <  qlk(n)<  n,  v(k,i)eE,  v,  (3.12) 

Note  that  we  may  let  p11  *  Q11  =*  0. 

The  followinq  result  states  that  the  algorithm  con- 
ik  ik 

verges  if  P  and  Q  are  not  too  large  compared  to  the 
degree  of  coupling  between  different  subproblems. 
[Tsitsiklis,  1983]. 

Theorem  3.1:  Suppose  that  for  each  i 

f  >  E  K  +  l  l  Kk  (Pik+Qk:l+Pjk*Qkl)  .  (3.13) 

'i  j=l  ^  k=l  j-1  13 

12  M 

Let  z(n)  =  (x,  (n)  ,x.  (n) , . ..  ,x  <n) ) .  Then, 
x  *  n 

lim  (z(n))=0,  y.  .  .  (3.14) 

n-3xi  i>3 

We  close  this  section  with  a  few  remarks: 

1.  The  bounds  provided  by  (3.13)  are  sufficient  for 
convergence  but  not  necessary.  It  is  known  (hertsekas, 
1983y]  that  a  decentralized  algoritlim  of  a  similar  type 
may  converge  in  certain  special  cases,  even  if  the  . 's 

ik  ik  1 

are  held  fixed,  while  the  bounds  P  ,  Q  are  allowed 

to  bo  arbitrarily  large.  So,  the  gap  between  the  suf¬ 
ficient  conditions  (3.13)  and  the  necessary  conditions 
may  be  substantial.  Further  research  should  narrow  this 
gap. 

2.  The  convergence  rate  of  the  distributed  algorithm 
should  be  expected  to  deteriorate  as  the  bounds  P  , 

ik 

Q  increase.  A  characterization  of  the  convergence 
rate,  however,  seems  to  be  a  fairly  hard  problem. 


Towards  Organization.-!  Design 


Suppose  that  we  have  a  divisionalized  organization 
and  that  the  objective  of  the  organization  is  to  mini¬ 
mize  a  cost  J  which  is  the  sum  of  the  costs  J1  faced  by 
each  division.  To  each  division,  these  corresponds  a 
decision  r..aker  which  is  knowledgeable  enough  about  the 
structure  of  the  problem  he  is  facing,  to  the  extent 
that  giver  a  tentative  decision  he  is  able  to  change  his 
decision  in  a  direction  of  improvement.  Moreover,  sup¬ 
pose  that  the  division:,  are  interacting  in  some  way;  that 
is,  the  decision  of  one  decision  maker  may  affect  the 
costs  of  another  division.  Suppose,  finally,  that  de¬ 
cision  makers  regularly  update  their  decisions  taking 
into  account  the  decision;  of  other  decision  makers  and 
the  effeccs  of  their  own  decisions  on  other  divisions. 
Mefs^ies  are  being  exchanged  from  time  to  time  carrying 
the  required  information.  Clearly,  the  mathematical 
model  of  Section  3  may  be  viewed  as  a  model  of  the 
above  situation. 

A  natural  question  raised  by  the  above  described 
situation  concerns  the  design  of  the  information  flows 
withia  the  organization,  r.o  as  to  guarantee  smooth 


operation.  i3ut  this  is  precisely  the  issue  addressed 
by  Theorem  3.1:  the  bounds  may  be  thought  as  quan¬ 

tifying  the  degree  of  coupling  between  divisions:  the 

bo  uni  P*^,  £>i3  describe  the  frequency  of  communications 
and  represents  the  speed  of  adjustment.  Theorem 

3.1  links  all  these  quantities  together  and  provides 
some  conditions  for  smooth  operation,  whereby  com¬ 
munication  rates  are  prescribed  in  terms  of  the  degree 
of  coupling. 

We  may  conclude  that  the  approach  of  Section  3  nay 
form  the  basis  of  a  procedure  for  designing  an  organiza¬ 
tional  structure,  or  -more  precisely-  the  information 
flows  within  an  organization.  Of  course,  Theorem  3.1 
does  not  exhaust  the  subject.  In  particular,  Theorem 
3.1  suggests  a  set  of  feasible  organizational  struc¬ 
tures,  with  generally  different  convergence  rates. 

There  remains  the  problem  of  choosing  a  "best"  such 
structure. 

It  is  also  conceivable  that  the  structure  of  the 
underlying  optimization  problem  slowly  changes  with 
time,  and  so  do  the  bounds  ,  but  in  a  time  scale 

slower  than  the  time  scale  of  the  adjustment  process. 

In  such  a  case,  the  bounds  P^3,  should  also  change. 
This  leads  to  a  natural  two-level  organizational  struc¬ 
ture:  At  the  lower  level,  we  have  a  set  of  decision 
makers  continuously  adjusting  their  decisions  and 
exchanging  messages.  At  a  higher  level,  we  have  a 
supervisor  who  monitors  changes  in  and  accordingly 

instructs  the  low-level  decision  makers  to  adjust  their 
communication  rates.  Note  that  the  supervisor  does  not 
need  to  know  the  details  of  the  cost  function;  he  oi.ly 
needs  to  know  the  degree  of  coupling  between  divisions. 
This  seems  to  reflect  the  actual  structure  of  existing 
organizations.  Low  level  decision  makers  are  “experts" 
on  tne  problems  facing  them,  while  higher  level  decision 
makers  only  know  certain  structural  properties  of  the 
overall  problem  and  make  certain  global  decision,  e.g. 
setting  the  communication  rates. 

Event-Driven  Communications 


We  now  discuss  a  slightly  different  "mode  of  opera¬ 
tion"  for  the  asynchronous  algorithm,  which  has  also 
clear  organizational  implications.  It  should  be  clear 
that  communications  are  required  by  the  distributed 
algorithm  so  that  decision  makers  are  informed  of 
changes  occuring  elsewhere  in  the  system.  Moreover, 
ik  ik 

the  bounds  p  ,  Q  of  Section  3  effectively  guarantee 
that  a  message  is  being  sent  whenever  a  substantial 
change  occurs.  The  same  effect,  however,  could  be  ac¬ 
complished  without  imposing  bounds  on  the  time  between 
consecutive  message  transmission:;:  each  decision  maker 
could  just  monitor  his  decisions  and  inform  the  others 
whenever  a  substantial  change  occurs.  It  seems  that 
the  latter  approach  could  result  in  significant  savings 
in  the  number  of  messages  being  exchanged,  but  further 
research  is  needed  on  this  topic. 


5.  Conclusions 

A  large  class  of  deterministic  and  stochastic  iter¬ 
ative  optimization  algorithms  admit  natural  distributed 
asynchronous  implementations.  Such  implementations 
(when  compared  to  their  synchronous  counterparts)  may 
retain  the  desired  convergence  properties,  while  reduc¬ 
ing  conmunication  requirements  and  removing  bottlenecks 
caused  by  communication  delays. 

We  have  focused  on  a  deterministic  g:  adient-type 
algorithm  for  an  additive  cost  function  and  we  have 
shown  that  the  communication  requirements  depend  in  a 
natural  way  on  the  degree  of  coupling  between  different 
components  of  the  cost  function.  This  approach  addresses 
the  basic  problem  of  designing  the  information  flows 
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gradient* type  algorithms,  as  well  as  stochastic  ap¬ 
proximations  algorithms.  Due  to  space  considerations, 
we  only  discuss  here  the  nature  of  the  results.  Exact 
statements  and  the  proofs  may  be  found  in  (Tsitsiklis, 
1983]  and  in  forthcoming  publications.  Preliminary 
v-rsions  of  these  results  appear  in  [Tsitsiklis, 
Bertsekas  and  Athens,  1983). 

To  discuss  tne  nature  of  the  convergence  conditions, 
we  distinguish  two  cases: 

A.  Constant  Step-Size  Algorithms  (e.q.  gradient 
algorithm) 

For  such  algorithms  it  has  been  shown  that  conver¬ 
gence  to  the  centralized  optimum  is  obtained,  provided 
that  the  time  between  consecutive  cosmunications 
between  pairs  of  decision  makers,  plus  the  communica¬ 
tion  delay,  is  bounded  by  an  appropriate  constant. 
Moreover,  the  larger  the  step-size  (i.e.  the  constant 
yin  equation  (1.2)),  the  smaller  the  above  mentioned 
constant.  The  latter  statement  admits  the  appealing 
interpretation  that  the  larger  the  updates  by  each 
decision  maker,  the  more  frequent  communications  are 
required. 

B.  Decreasing  Step-Size  Algorithms  (e.q.  stochastic 
approximation  algorithms) 

In  this  case,  the  algorithm  becomes  slower  and 
slower  as  the  time  index  increases.  This  allows  the 
process  of  comnunications  to  become  progressively 
slower,  as  well.  In  particular,  it  has  been  shown  that 
convergence  to  the  centralized  optinum  is  obtained 
even  if  the  time  between  consecutive  communications 
between  pairs  of  decision  makers,  plus  communication 
delays,  increase  without  bound,  as  the  algorithm 
proceeds,  provided  that  the  rate  of  increase  is  not 
too  fast. 

3.  A  Distributed  Gradient  Algorithm 


In  this  section  we  consider  a  rather  simple  distri¬ 
buted  algorithm  for  minimizing  an  additive  cost  func¬ 
tion.  Due  to  the  simplicity  of  the  algorithm,  we  are 
able  to  derive  convergence  conditions  which  are  gen¬ 
erally  tighter  than  the  general  conditions  discussed 
in  the  previous  sections.  It  will  be  seen  shortly, 
that  these  conditions  admit  appealing  organizational 
interpretations . 

The  conceptual  motivation  behind  our  approach  is 
based  on  the  following  statement: 

If  an  optimization  problem  consists  of  sub¬ 
problems,  each  subproblem  being  assigned  to 
a  different  decision  maker,  then  the  frequency 
of  communications  between  a  pair  of  decision 
makers  should  reflect  the  degree  by  which  their 
subproblems  are  coupled. 


The  above  statement  is  fairly  hard  to  capture  math¬ 
ematically.  This  is  accomplished,  however,  to  some 
extent,  by  the  model  and  the  results  of  this  section. 

Let  J:  Rm~*R  be  a  cost  function  to  be  minimized 
with  a  special  structure: 


J(x) 

,i 


«  J(x1, . .. 
RH-*R. 


’XM> 


M  : 

l  J  (x  , . 

i-1  L 


"V 


(3.1) 


where  J  :  R”-*R.  So  far,  equation  (3.1)  does  not  im¬ 
pose  any  restriction  on  J;  we  will  be  interestad,  how¬ 
ever,  in  the  case  where,  for  each  i,  J*  depends  on  x^ 

and  only  a  few  more  components  of  x»  consequently,  the 

Hessian  matrix  of  each  J*'  is  sparse. 

We  view  as  a  cost  directly  faced  by  the  i-th 
decision  maker.  This  decision  maker  is  free  to  fix 
or  update  the  component  x^  but  his  cost  also  depends 

on  a  few  interaction  variables  (other  components  of  x) 


which  are  under  the  authority  of  other  decision  makers. 

We  may  visualize  the  structure  of  the  interactions 
by  means  of  a  directed  graph  0(V,E) : 

(i)  The  set  V  of  nodes  of  G  is  V-{l,...,M} 

(ii)  The  set  of  edges  E  of  the  graph  is 

£•{ (i, j) :  depends  on  x^}  (3.2) 

Since  we  are  interested  in  the  fine  structure  of  the 
optimization  problem,  we  quantify  the  interactions 
between  subproblems  by  assuming  that  the  following 
bounds  are  available: 


32J 

32Jk 

3xi3Xj 

1Kij' 

3x.3Xj 

Vxe  rm  , 


(3.3) 


where  (without  loss  of  <*.nerality) 


hi  ^  j,  ’Si  • 


(3.4) 


A  synchronous  distributed  gradient-type  algorithm  for 
this  problem  could  be: 


For  each  (i, j)e  E,  decision  maker  }  evaluates 


xj(n) 


121 

3x. 


(x  (n) ) 


(3.5) 


For  each  (i,j)e  E,  decision  maker  j  transmits  \^(n) 
to  decision  maker  i. 


3.  Each  decision  maker  i  updates  x^  according  to 


xi(n+l)  »  xi(n)  -y^  J  X'(n) 
j«l 


(3.6) 


4. 


For  each  (i, j)e  E,  decision  maker  i  transmits 
x^(n+l)  to  decision  maker  j. 

We  now  consider  the  asynchronous  version  of  the 

above  algorithm.  Let  xi(n)»(x^(n),...,x^(n))  denote  a 

decision  vector  (element  of  RM)  stored  in  the  memory 
of  decision  maker  i  at  time  n.  Wa  also  assume  that 
each  decision  maker  i  stores  in  his  memory  another  vec¬ 
tor  (A^(n) , . . .  ,A*(n))  with  his  estimates  of 


3J~  3J*1 

slT"  '  •  *  •  —  •  Unlike  the  synchronous  algorithm,  we 

3x.  3x. 

do  not  require  that  a  message  be  transmitted  at  each 
time  stage  and  we  allow  communication  delays.  So  let: 
jci 

p  (n)  -  the  time  that  a  message  with  a  value 
of  x^  was  sent  from  processor  k  to 

processor  i,  and  this  was  the  last 
such  message  received  no  later  than 
time  n. 


<lki(n> 


the  time  that  a  message  with  a  value  of 


3j 

3x. 


was  sent  from  processor  k  to  processor 


i,  and  this  was  the  last  such  message 
received  no  later  than  tire  n. 

For  consistency  of  notation  we  let 

ii,  ,  ii 
p  (n)  -  ~ 


q  (n)-n. 

With  the  above  definitions,  we  have: 
_k,  ki. 


x£(n> 

A*(n) 


3J* 

3x. 

X 


(x*(qkl 


Yi.vn  • 

(3.7) 

we  have: 

Vh,  V(k,i)e  E, 

(3.8) 

(n))),  V«,V(i,k)€  E. 

(3.9) 
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in  a  distributed  organization  and  may  form  t.ie  basis 

for  a  systematic  approach  to  organizational  design. 
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ABSTRACT 


The  problem  of  designing  the  allocation  of 
information  proceaalng  taeka  to  organization  members 
mho  Interact  with  the  organization* a  environment  la 
formulated.  Two  intonation  atrategiea  ara  considered 
for  redneing  the  load  on  each  member  while 
accompl iahing  the  overall  taak: 'la)'  creation  of  eelf 
contained  tanks,  and'TC^creation  of  alack  resources. 
The  foner  leads  to  parallel  prceesaing,  while  the 
latter  la  accomplished  through  alternate  processing 
rules.  The  two  basic  r.tratecles  can  be  integrated  _to 
produce  a  wide  van  ty  of  intonation  structures, 

INTRODUCTION 


In  military  organizations  the  ability  to  process 
intonation  in  an  eft iclent  (i.e.,  quick  and  accurate) 
manner  can  be  of  critical  importance.  Time 
constraints  and  Limitations  on  the  availability  and 
eapabilities  of  equipment  and  personnel  may  reduce  the 
rate  at  which  the  organization's  decision  makers  (DM*) 
can  respond  to  intonation  they  receive.  These 
constraints  and  limitations  may  force  the  DNs  to 
become  overloaded  (i.e.,  each  DM  is  assigned  more 
tasks  than  ha  it  able  to  ezecute  in  the  preaeribed 
time  interval,  while  still  maintaining  a  given 
parfenaaoe  level).  Information  reduction  strategies 
may  be  employed  to  avoid  overload.  These  strategics 
are  implicit  in  the  definition  of  information 
structures.  A  methodology  it  presented  in  this  paper 
for  designing  the  information  atructures  for  DMs  who 
comprise  the  boundary  between  an  organizaticn  and  its 
environment.  The  objective  of  these  designs  is  to 
proeess  all  information  received  by  the  organization 
efficiently,  using  the  minimum  number  of  organization 
members . 


The  design  of  organizations  can  be  decomposed 
into  two  interrelated  problems:  the  organizational 
form  problem  in  which  the  information  and  decision 
structures  are  specified,  and  the  organizational 
control  problem  in  which  the  operating  rules  or 
procedures  and  the  monitoring  and  enforcement 
atrategiea  are  determined.  The  manner  in  which 
information  is  received  and  processed  by  organization 
membera  is  e  key  descriptor  of  the  structure  of  an 
organization.  Indeed,  commanication  -  the  exchange  of 
information  and  the  transmission  of  meaning  -  is  the 
very  essence  of  an  organisation.  To  move  from  an 
unorganized  state  to  an  organized  state  requires  the 
introduction  of  eonstralnte  and  restrictions  to  reduce 
diffuse  and  'andon  communications  to  channels 
appropriate  for  the  accomplishment  of  orgsnizational 
objectives  (t  sis)  [11. 

The  organization,  perceived  as  an  open  system 
[2],  internets  with  ita  environment:  it  receives 
signals  or  messages  in  various  forma  that  contain 
information  relevant  to  the  organization's  tasks. 
These  messages  must  be  identified,  analyzed  and 
transmitted  to  their  appropriate  destinations  within 


the  organization.  From  this  perspective,  the 
organization  acta  as  an  information  user. 

How  an  organization  re : elves  signals  from  ita 
environment  has  direct  coaseqaenees  on  the  internal 
structure  of  the  organization  and  on  its  performance. 
The  specific  structure  depends  on  the  nature  and 
characteristics  of  the  signals  that  can  be  received, 
on  the  taak  to  be  performed,  and  on  the  eapabilities 
and  limitations  of  the  individual  members  comprising 
the  organization.  By  considering  only  the  boundary 
between  the  organization  and  the  environment,  a  major 
simplification  occurs:  the  boundary  ; cae If  can  be 
thought  of  aa  a  single  eehelon  of  organization 
members.  Vhile  these  mrmbers  nay  oecupy  different 
positions  in  the  internal  organizational  structure, 
the  relevant  characteristic  1^  that  they  receive 
direot  inputs  from  eourees  outside  the  organization. 
(Figure  1).  In  that  sente,  no  member  is  subordinate 
to  another:  i.e.,  the  membera  eonatitute  a  single 
echelon.  However,  Individuals,  or  groups  of 
individuals,  can  have  very  different  capabilities  and 
limitations  that  reflect,  indirectly,  their  position 
and  function  in  the  organization.  For  example,  they 
may  be  able  to  proeeae  only  certain  classes  of  signals 
(specialization)  or  deal  with  limited  levels  of 
uncertainty.  Since  it  la  important  to  remember  that 
the  single  echelon  may  include  commanders  as  well  *s 
operators  of  monitoring  systems. executives  as  well  aa 
clerks,  the  term  deo  is  ioamaken  has  been  used  to 
describe  ell  members.  i 


Figure  1  Environment  and  Single  Echelon. 

(Decisionmakers  with  hatched  areas 
comprise  the  single  eehelon.) 


TEE  MODEL 

The  single  echelon  receives  data  from  one  or  more 
sources  external  to  the  organization.  Every  6n  unite 
of  time  on  the  average,  each  source  n  generates 
symbols,  signals,  or  messages  xal  frem  its  associated 
alphabet  Z&,  with  probability  pa4,  i.e.. 
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-  P(»m  -  *.t>  »  *>4  .  X, 


1,2, .. . ,p 


■  -  1,2,. ...N 


fktn  T,  the  diaeatloa  Of  *n.  Therefore,  l/&n  It 
the  aeaa  frequency  of  symbol  generation  from  louei  a. 

Tht  task  to  bo  perforaed  It  defined  at  tho 
proeata lag  of  tko  input  tpabol a  x_  bp  tho  single 
ookoloa  to  prooaoo  oatpat  symbols.  It  It  ottaaod  that 
a  apeeifle  ooaplox  tatk  that  aatt  ba  perforaed  eta  ba 
aodalad  bp  N'  task  sources  of  data.  Bather  tkaa 
aoaeideriag  thato  toureet  taparatalp,  oaa  inpertoaree 
a*,  aoapotad  of  thaaa  N*  sourest,  la  eraated.  Tha 
lapat  apabol  x  ’  aap  ba  xapraaaatad  bp  aa  N'- 
dlaaa.loaal  vaetor  with  aaeh  of  tha  sources 
rapraaaatad  bp  a  eoapoaaat  of  thlt  vector,  l.a.. 


a*  *  (*x.*,,....aN,)  i  a' a  X 


To  dataralaa  tha  probability  that  tpabol  at  la 
gsaeratad,  tha  independence  botwaaa  ooapoaaata  aart  ba 


eoasidered.  If  all  i  eoapoaaatt  ara  aataallp 
ladapaadaat,  tkaa  p.  1  It  tha  product  of  tha 
probabilities  that  aael  eoapoaaat  of  a]  takaa  oa  Iti 
respeetive  walaa  froa  Ita  attoelatad  alphabet : 


will  ba  daflaad  whlah  suggest  that  aartala  structures 
will,  la  faet,  ba  aora  effective  tkaa  othara  for 
parforatBg  a  apaolflo  ooaplaa  taak.  For  tha  tppaa  of 
orgaalaatloaa  eoaaldarad,  tha  perforaaaee  of  a  eoaplex 
taak  la  equivalent  to  tha  proeaaaiag  of  laforaatloa, 
where  laforaatloa  la  deflaad  to  be  tha  data  received 
bp  tha  HU  la  tha  eeheloa.  Galbraith  ha  a  argaed  that 
warlatloaa  la  the  aaoaati  of  laforaatloa  (data)  that 
ara  act  proeaaaad  ara  prlaarllp  reapoaalble  for  tha 
warlatloaa  la  orgaalaatloaal  font  [3],  Sack 
warlatloai  ara  largalp  a  raaalt  of  tha  aaeartalatp 
attoelatad  with  a  glwaa  talk.  Uaeertalatp  haa  baaa 
daflaad  to  ba  *tha  dlffareaoa  batweaa  tha  aaount  cf 
data  required  to  perfora  tha  taak  a  ad  tha  aaoaat  of 
data  alraadp  poaaeaaed  bp  tha  organisation*  (31. 

It  will  ba  aataaad  that  tha  aaaber  of  DMa 
aaeataarp  la  tha  eekaloa  la  greater  thaa  oaa.  Tha 
aadarlplag  p-aaise  la  that  ao  W  aloaa  la  able  to 
proeata  tha  repaired  aaoaat  of  data  while 
alaaltaaeoaelp  aehlawlag  tha  r'qaired  performance 
lawal. 

Several  faetort  affaet  tha  tlaa  rtqalrad  for  t  DM 
to  proeata  laforaatloa.  Tha  aaeartalatp  of  tha  lapat 
tpabol  gaaaratad  tad  the  aaaber  of  potalble  lapat- 
oatpat  ratpoataa  are  'wo  of  thaaa  faetora.  Keying  la 
oa  thata  two  faetort,  a  procettlag  tlaa  faaetloa  vhleh 
haa  aa  laforaatloa  latarpratatloa  eaa  ta  Introduced 
(41.  Let: 

a  ba  tha  aaaber  of  eoapoaaatt  tha  a-th  HI 
proeeaaaa 

t*(a)  ba  a  paraaatar  vhleh  la  a  faaetloa  of  the 

aaaber  of  eoapoaaatt  assigned  to  tha  a-th  LM 


N* 


Thaa  all  eoapoaaatt  of  tha  lapat  vaetor  ara  aataallp 
ladapaadaat,  tkla  la  referred  to  aa  balag  of  flaeat 
grala.  la  aaap  situations,  thlt  attaaptloa  It 
aaraallatle.  It  la  aora  coaaoa  to  have  aoaa 
ooapoaaata  probabilistically  dapaadeat. 


o"  ba  a  const sat 

p(tvj)  ba  the  probability  attoelatad  with  tha  J-th 
aleaaat  of  the  k-th  partitloa  vector's 
alphabet . 


-  t*(a)  -  a"  log  pCtj.j) 


If  two  or  aora  eoapoaaatt  are  probabiliatleallp 
dapaadeat  oa  aaeh  other;  bat  at  a  groap  are  aataallp 
ladapaadaat  froa  all  other  eoapoaaatt  of  tha  lapat 
vaetor,  thaa  thata  dapaadeat  eoapoaaatt  eaa  ba  treated 
aa  oaa  taper  eoapoaaat ,  with  a  aaw  alphabet.  Thaa  a 
a  aw  lapat  vaetor,  a.  it  daflaad,  eoaposed  of  tha 
aataallp  ladapaadaat  eoapoaaat a  tad  thata 
sapereoapoaeatt.  Thlt  aaw  a  It  oa  finest  grala. 


If  thia  processing  tlaa  faaetloa  it  averaged  over  all 
poitible  eleaeata  of  tha  lapat  tpabol  to  be  presetted 
[4],  thaa 


*k 


Thlt  aodel  of  tha  toareat  iapliet  spate hr onlxatloa 
betweea  tha  generatioa  of  tha  iadividntl  sonrce 
eleaeata  ao  that  they  aap,  la  faet,  ba  treated  at  one 
lapat  tpabol.  Specif ieallp,  it  it  attaaad  that  the 
■eaa  laterarrival  tlaa  for  aaeh  eoapoaaat  ta  it  equal 
to  A.  It  la  alto  aataaad  that  tha  generation  of  a 
partlealar  lapat  vaetor.  a,,  la  ladapaadaat  of  tha 
apabola  geaoratad  prior  to  or  after  it. 


*(t)  -  e"  J  jltjj)  log  pUfcj)  «) 


la  laforaatloa  theory,  tha  last  t era  of  eqaatioa  «), 
la  daflaad  to  ba  tha  eatropp  B  aaaoeiatad  with  tha 
group  of  eoapoaaatt,  k.  It  followa  that: 


The  DMa  la  tha  tiagle  echelon  and  the  particular 
aequenclag  and  allocation  of  laforaatloa  reeelvtd  bp 
tha  DMt  define  the  orgaaixatioaal  fora.  The  deilgn  of 
atrueturaa  la  which  ao  DM  la  aubordlaata  to  aap  other 
DM  will  depend  upon  tha  eoattrainta  lapoaed  bp  the 
orgaalxatloa.  Two  be sic  prtaitet  of  the  design  aethod 
ara  that  (a)  la  geaaral,  there  ia  ao  bait  wap  to 
orgaaixa  and  (b)  aap  wap  of  organixlng  ia  aot  equally 
effaetiva  with  reipeet  to  perforaing  a  tpeelfio  tatk 
(31.  Characteristics  of  tiagle  echelon  organlxationt 


if  -  t“(a)  ♦  c"  Bj. 


Tha  quantity  i"  it  a  aoaotosie  nondaereating 
fuaetloa  of  tha  auaber  of  eoapoaaatt,  t,  attigaed  to 
tha  a-th  HI.  It  dateribat  tha  ilxad  cost  la  texat  of 
tlaa  required  to  proeeat  aap  coaponent  regardless  of 
tha  aaoaat  of  laforaatloa  eoatalaad  ia  it.  A 
aultiplieativa  faaetloa  of  tha  auaber  of  coapoaeata 


s’VV  VV  ..-vv  V 
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uii|Mi  to  the  a-tk  DM  It  often  >itaw4,  1.*., 


t"(s)  -  t"e  <*> 

o 


*k*r«  t#  is  i  constant. 

Tka  parameter  «  in  (7)  it  assumed  to  bt  t  constant 
(or  otok  DM  considered.  To  compute  thj  average 
processing  tias  (or  DN^  to  process  tks  group  of 
components,  k,  tka  aatrop;  Hj.  aast  first  ba  computed. 
Many  distiaet  groupings  of  the  coaponsnts  of  £  eaa  ba 
constructed  sad  aa  entropy  asst  be  coapatsd  for  each 
of  those  groupings.  For  each  sapareoapoasat ,  x(n) , 
tha  aorraspoadiag  entropy  is: 


■a  w  -  2  pCx(b)  «  x^(a)}  log  (p(x(a)  »  x^n))} 


Siaac  it  hat  beta  ttsaaad  tbat  all  aoapoaaats  ia  tks 
iapat  rector  tra  actually  iadepaadaat,  tlia  entropy  of 
say  sat  of  aoapoaaats  it  equal  to  tks  tna  of  tha 
aatropies  of  each  of  tha  components  ia  the  sat.  i.e.. 

^  J  Ha  (10) 

aiL 

This  rasalts  la  sabstsatisl  savings  ia  ooapatstioa. 
lather  thaa  assdiag  to  eoapata  tbs  probabilities  of 
sack  elaaeat  of  each  alphabet  for  each  distiaet 
groaping  of  components  ia  order  to  eoapata  the 
aasoeiatad  aatropy,  it  ia  only  aaeassary  to  eoapata 
tha  entropy  for  aaeh  component  of  tha  iapat  motor. 

ovmoAD 

Than  ■  U  ia  overloaded,  ha  eaa  reset  ia  oaa  of 
several  vaya.  Ha  aay  daeida  to  radaea  tha  aaoant  of 
data  ha  has  to  proesss  by  either  read only  (rejection) 
or  selectively  (filtarlag)  emitting  data.  Tha  aaoant 
of  data  ha  nay  ba  repaired  to  proeaaa  aay  ba  radaead 
also  by  havieg  it  preprocessed.  He  aay  daeida  to 
radaea  the  aaabar  of  categories  of  discriaination 
(i.e.,  approxlaate  tbe  iapvta)  or  ha  aay  radaea  the 
required  level  of  aeearaey  for  processing  tha  data 
and,  ia  so  doing,  redace  the  aaaber  of  different 
oatpats.  If  these  alternatives  aeea  unsatisfactory , 
ha  aay  decide  to  receive  all  tbe  data  alloviag  queues 
to  baild  ap,  delaying  tha  processing  daring  periods  of 
peak  loads  and  atteaptiag  to  catch  no  daring  tine 
intervals  vhsa  iapat  symbols  are  assigned  to  other 
DMs.  Otherwise,  the  DM  aay  a  imply  choose  not  to 
par fora  tha  task  [3], 

Aa  alternative  to  having  tha  data  praprocassad  is 
tha  eaployaeat  of  aaltipla  parallel  channels  t2],  [<]. 
This  paper  will  foens  oa  tha  asa  of  parallel  channels 
(DMs)  which  guarantee  all  tha  data  received  are 
procasaad  immediately.  Queuing  alec  gaarantees  all 
the  Information  is  processed,  bat  aot  iaaedlately. 
Tha  other  approaches  to  reducing  overload  are 
unappealing  to  tha  organisation  designer  as  a  first 
choice  la  two  ways:  (1)  data  ara  omitted  and  (2) 
waatefal  expenditures  are  incurred  since  the  valaa  of 
tha  data  seat  bat  ao  ased  is  lost. 

The  concept  of  parallel  DMs  is  analogous  to  the 
idea  of  distributed  information  processing  with  each 
DM  performing  a  aebtask.  Than  tha  idea  of  considering 
a  single  echelon  structure  only  is  a  relevant  issue 
for  aay  orgaalsatioa.  Many  studies  hsve  revealed  that 


aa  the  uncertainty  of  tha  tasks  increases,  the 
•flatter*  (i.e.,  more  distribatad)  aa  organisation 
should  be.one  with  respect  to  its  DNe  [21. 

Galbraith  [3]  has  suggested  two  information 
redaction  strategies  for  organisations  to  address  this 
isssaa:  (1)  Creation  of  Self-Coatainad  Tasks,  sad  (2) 
Creatioa  of  Slack  Kssoarces.  Ia  tka  first  strategy, 
tha  original  task  is  divided  iato  a  art  of  snbtaaks. 
This  reduces  (si  the  diversity  of  oatpats  each  DM  ia 
the  organisation  can  produce  aad  (b)  the  Bomber  of 
different  inpats,  since  the  DM  need  only  receive  tha 
inputs  pertaining  to  the  given  subtask.  The  stack 
resource  to  be  considered  for  the  second  strategy  will 
be  time  and  will  involve  a  reductioa  in  the  required 
level  of  performance  with  respvot  to  time.  For 
example,  if  it  eaa  originally  required  to  have  tho 
task  completed  ia  time  &,  this  time  aay  be  ex. ended  to 
some  multiple  of  i. 

Two  types  of  processing  modus  will  be  considered, 
parallel  processing,  which  is  associated  with  the 
first  stratsgy,  and  alternate  processing,  which  ia 
associated  with  the  seeoad  strategy.  These  two  modes 
srs  the  fundamental  strategies  employed  to  guaraatee 
all  data  art  processed  eithout  overloading  any  CM  ia 
the  echelon.  Th«  two  fundamental  modes  can  be 
integrated  ia  variov*  ways  to  as  tc  develop  more 
complex  organisational  forma. 

rAIALLEL  PROCESSING 

In  a  parallel  processing  structure,  partitions  of 
the  input  symbol  are  selected  aad  assigned  to  the  DMs 
ia  the  organisation.  The  group  of  DMs  who,  together, 
proceas  the  entire  input  vector  form  the  single 
echelon.  Each  DM  is  eonstraiaed  to  proceas  a 
partition  of  components  from  those  that  do  aot 
overload  him  (i.e.,  from  those  that  result  in  a  mean 
processing  tine  of  6  or  less). 

Mathematical  programming  ia  aa  appropriate 
modeling  approsch  for  this  class  of  problems.  This 
approach  seeks  *the  optimum  allocation  of  limited 
resources  among  competing  aotivites  under  a  set  of 
eoastrsiats  imposed  by  the  nature  of  the  problem  being 
studied*  [3).  Ia  this  context,  the  components  of  the 
input  vector  correspond  to  the  limited  resources ,  the 
DMs  correspond  to  the  competing  activities  end  the 
constraint  sets  include  processing  time  capabilities 
sad  specialisation  limitations  of  each  DM. 

Individual  components  are  selected  and  assigned 
to  each  DM  in  tha  model.  The  group  of  components 
assigaed  to  the  a-th  DM  defines  the  partition  vector. 
The  conditions  for  seleetiag  end  assigalag  components 
are: 

(a)  every  ecuponeat  is  processed,  aad 

(b)  ao  DM  is  overloaded. 

Let  Tea  be  *  binary  variable  which  equals  one  if  the 
a-tb  component  is  assigned  to  the  a-th  DM.  xero 
otherwise.  To  guarantee  that  every  component  is 
processed  once  sad  only  once,  the  following  set  of 
eoastrsiats  ia  established: 

M 

IM  *  -  1  :  a  -  1.2... ..N  (11) 

u-1 


where 
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T  -  0.1  (12) 


A  utforl  structure  which  links  every  component 
to  every  DM  1*  lion  In  Figaro  2. 


Figaro  2  Parallel  Processing  Stractaro. 

A  DM  will  not  be  owerloaded  if  the  average  tiae 
he  requires  to  prooeea  the  coaponents  assigned  to  Mb 
does  no*,  exceed  6.  The  aeen  processing  tiae  is  given 
by  (eqa.  (7)  and  (8)) 


A  deterministic  strategy  is  one  in  which  the 
ordering  of  the  aseignaent  of  the  input  vectors  to  the 
DNs  is  fixed.  In  order  to  specify  the  optiaal 
information  atrnctnreo  associated  with  this  strategy, 
it  it  necessary  to  determine  sianltaneonsly: 

(a)  the  ainiana  number  of  DMs.  M*.  necessary  to 
process  the  input  vectors  withont  any  one 
being  owerloaded,  and 

(b)  the  frequency,  q^,  with  which  each  of  theae 
DMs  receives  an  input  vector. 


st“  ♦  c*  H 


(13) 


where  a  is  the  number  of  coaponents  assigned  to  the 
DM.  Since  the  coaponents  have  been  aasuard  to  be 
Mutually  independent,  the  entropy  H  it  equal  to  the 
aua  of  the  eatropiee  of  the  a  coaponents.  Since  the 
components  aatigned  to  the  a-th  DM  are  not  known  a 
priori,  a  binary  indicator  variable  Iu  is  introdneed 
which  includes  the  tiae  for  processing  component  *a 
only  if  it  is  assigned  to  the  DM: 

.  t 

l(t"+o"  H  )  if  I  -  1 
(t"+c"  H  )  T  -<  "  (14) 

n  u  I 

(°  “*na-° 

Furthermore : 

N 

f"  -  5  <t"*o"  ■  )  T  <8  l  a  ”  1,2, ... ,M  (IS) 
4e  n  u 

a-1 

i.e.,  I*  anst  be  lest  than  or  equal  to  6  to  guarantee 
the  a-th  DM  ie  not  overloaded. 

The  objective  function  for  this  problea  it  to 
ainiaixe  the  noaber  of  DMs  required  to  process  all  the 
eoaponente  withont  overload.  The  information 

structure  can  be  constructed  froa  'he  optiaal  eolation 
to  this  problea.  The  optiaal  eolation  would  identify 
who  and  how  aany  DMs  are  included  ia  the  echelon  and 
what  subtaaks  they  are  perforaing. 

ALTERNATE  PROCESSING 

In forms t ion  structures  based  on  alternate 
processing  are  appropriate  when  the  input  vector 
eannot  be  partitioned  (i.e.,  when  the  strategy  of 
creating  self-contained  tasks  cannot  be  used  to  avoid 
overload).  This  strategy  involves  the  creation  of  a 
alack  reeonree,  in  this  case,  tiae.  That,  each  DM  is 
given  aore  tine  to  process  tbs  inpat  assigned  to  hia, 
which  introduces  a  delay  strictly  greater  than  6. 
Figure  3  illustrates  alternate  processing. 


Figure  3  Alternate  Processing  Structure. 

A  very  staple  method  for  solving  this  problea 
exists.  The  overload  constraint  reqnirea: 


q^  <  8/t* 


(IS) 


where  T*  is  the  average  tiae  for  the  m-th  DM  to 
process  an  input  vector.  Tithout  any  loss  of 
generality,  the  DMs  aay  be  re-indexed  according  to 
their  processing  tiae  functions  (i.e.,  let  the  first 
DM  be  the  aost  efficient,  and  the  m-th  DM  be  the  leeet 
efficient,  so  that  r  <?*<...<  ?M).  The  other 
constraint  on  ‘se  problea  is  that  all  of  the  data  be 
processed: 

M* 

J  q,  ■  l  (17) 

where  M*  has  yet  to  be  deterained.  The  eolation 
proceeds  by  choosing  DNs  in  order  of  efficiency  aatil: 

A+l  A 

J  »/f*  >1*5  »/*"  (lb) 

n-1  m-1 


If  the  right  hand  side  of  equation  (18)  is  an 
equality,  then  the  ainiana  noaber  of  DMs,  M  , 
necessery  to  process  the  input  vectors  withont 
overload  it  sqnal  to  X,  and 


if  1  1  a  <  X 


otherwise 


(19) 


If  the  right  hand  side  of  equation  (18)  it  a 
strict  inequality,  then  the  ainiana  ncaber  of  DNs,  M*, 
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equal  to  X+l.  BiOitu 
X+t 

J  */*“  >  1  . 

»1 

,  nit  bo  defined  to 

6/f"  -  pm  it  1  <  m  <  X+l 

(20) 

0  oth#rwi»« 

•rt 


fim  s  p/X+1  a  -  1.2 . X+l 

d 


X+l 

p  -  <  ]>  6/?")  -  1  (21) 

m-l 

wot  P*  may  bo  tot  to  ensure  that  all  of  tbo  art 
itional,  to  that  a  cyclical  strategy  can  bo  used. 

A  cyclical  strategy  la  defined  aa  a  atratogy  la 
kick  tbo  ordering  of  the  aaaignaent  of  the  Input 
ectora  to  the  DNa  la  repeated  every  o'  Input  vectora. 
a  the  eaae  that  the  right  hand  a  lie  of  (li)  la  an 
quality  and  at  leaat  one  q,  la  Irrational,  a  cyclical 
trategy  cannot  be  naedi  bat  It  aiay  be  argned  that 
'lace  the  f*  are  sanally  as.  noted  rather  than 
revisely  calculated,  a  cyclical  atrategy  can  alvaya 
a  need.  In  thie  caae.  a* .  the  nunber  of  inputa  in 
ne  cycle,  la  the  lowest  eoanoa  denominator  of  the 
a'e.  The  information  atrncturea  for  a  determinietle 
yclical  atrategy  may  now  be  completely  apecified. 

Define  F  to  be  the  ordered  aet  of  indicea  on  one 
ycle  of  o'  input  vectora:  that  la. 

F  •  (f  I  f  -  1.2. ...o')  (22) 

low  let  F*  be  a  aubaet  of  F  where 


F*  •  (f  a  F  I  input  la  aaaigned  to  DIM 

m  -  1.2.....**  (23) 

fith  the  indicator  defined  aa: 

.  (  1  iff  if*  m  -  1.2 . M* 

*•  -  (24) 

(  0  otherwlee 

the  only  re qci remeat  on  the  aatlgnment  of  Input  to  the 
m-th  DM  la  that: 

p* 

5  _  -  q_  m  -  1.2. ...,M*  (25) 

t-i  •* 

Sic.ce  Input  vectora  only  arrive  once  every  b  time 


unite,  if  la  aaaigned  at  time  t-(k  o'+  f)6,  where 
k»0,l....  determinea  the  number  of  cyclea  that  have 
been  completed  at  time  t.  The  optimal  eolation 
identifies  the  number  and  propertiea  of  the  many  DMa 
who  are  included  in  the  aingle  echelon  and  the 
frequency  with  which  each  DM  reeeivea  aa  input  ayabol. 

DESIGN  METBODOLOGT 

The  modeling  and  analytic  of  the  information 
structures  of  tingle  echelon  organlzationa  lead  to  a 
five  atep  dealgn  methodology.  The  firat  atep  la  the 
analyaia  of  the  propertiea  of  the  inputa  to  the 
organization,  i.o.,  of  the  taak  to  be  performed,  and 
the  mathematical  modeling  of  the  aource  to  that  the 
coavonenta  of  the  aymbol  vector  are  mutually 
independent.  The  second  atep  conelata  of  the 
aeleetion  of  the  appropriate  information  reduction 
atrategiea  in  view  of  the  DMa  that  are  available  to 
the  deaigner.  The  baaie  onea  conaidered  are  parallel 
and  alternate  proceaaing.  Integrated  atrategiea  that 
conalat  of  varioua  combinations  of  alternate  and 
parallel  proeeaalng  are  alto  conaidered.  It  ia  here 
that  the  organization  deaigner’a  understanding  of  the 
taak  to  be  performed  by  the  organization  ia  crucial. 
The  third  step  conaiata  of  the  formulation  of  the 
mathematical  model  that  repreaenta  the  (integrated) 
information  reduction  atrategy  selected  la  the  second 
atep.  All  eonatralata  that  characterize  the  taak,  the 
organization,  and  the  individnal  DMa  are  alto 
ezpreaaed  analytleally.  The  fourth  atep  la  the  poaiag 
of  the  optimlztioc  problem.  Given  the  mathematical 
formulation  of  the  single  echelon  model,  an  objective 
function  mutt  be  defined.  >hile  in  specific  caaea, 
solutions  to  the  optimization  problem  eaa  be  obtained 
from  straightforward  computations.  the  general 
formulation  lends  itself  to  mathematical  programming 
techniques.  In  particular,  generalized  network  (GN) 
formulations  proved  moat  attractive  because  of  the 
efficient  algorithms  whieh  ezlat  to  solve  them  (71. 
Knapsack  problems  and  mined  integer  linear  programs 
can  also  be  considered.  The  fifth  atep  conaiata  of 
ezpresaing  the  solution  to  the  optimization  problem  in 
the  form  of  an  information  atrueture.  The  structure 
is  then  evaluated  to  determine  whether  the  trade-offs 
between  the  number  of  DMa  used  and  the  delays  are 
acceptablei  if  not,  then  the  designer  should  return  to 
atep  2  and  revise  the  proposed  information  reduction 
atrategy. 

Ia  the  aezt  seotioa,  re  ezample  la  presented  that 
illustrates  the  application  of  the  five  atep 
methodology. 

EXAMPLE 

Consider  G  distinct  sources.  each  source 
gene.-atlcg  a  vector  of  signals.  The  taak  ia  such  that 
each  aource  output  has  to  be  processed  intact  (i.e., 
it  cannot  be  partitioned).  There  are  N  DMa  who  can 
rceelve  the  generated  signals  and  none  of  these  DMa 
e. n  process  the  output  of  any  of  the  G  sources  without 
being  overloaded.  A  parallel/alternate  information 
structure  teems  appropriate.  A  generic  form  of  this 
atrueture  is  shown  is  Figure  4. 

Step  1:  TASK  MODELING 

The  aupercomponent  conaiata  of  G  synchronized 
sources  that  generate  vector  signals.  The  mean  signal 
generation  rate  la  &~*.  The  elements  of  the  input 
vector  can  be  partitioned  into  G  seta,  each  aet 
eorreapondlng  to  the  outpot  of  each  of  the  individual 
sources.  This  ia  the  finest  grain  decomposition  of 
the  input. 
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mhmrm  /S\  input  •»***»'  9,n*Mlio«  tin* 

Figure  4  Ptti!lil/Alt<rut<  Information  St  roc  tar*. 
Stop  2:  INFORMATION  REDUCTION  STRATEGY 


0  <  q 

*■ 

<  8/T^ 

g  -  1.2 

n  -  1.2, 

(2?) 

Any  DMs 

that  ar* 

resigned 

input 

for 

processing 

with  zero 

frequency 

are  excluded 

from  th*  singl*  echelon.  Furthermore,  each 
DM  It  allowed  to  nctin  inputs  from  nt  most 
on*  of  th*  0  sources.  Constraints  (28)  and 
(29)  guarantee  these  condition*,  sit**  th* 
binary  variable  T  it  s*ro  nh*n  th*  sr  th  DM 
i*  assigned  th*  oatpat  of  th*  g-th  sonre*. 

0 

J  -  0  -  1  m  -  1.2 . M  (28) 

8-1 


g  -  1.2 . 0| 

(29) 

■  -  1,2,... .M 


Not*  that  while  th*  inputs  from  th*  sources  ar* 
received  by  th*  singl*  *ch*loa  simaltanooasly ,  th* 
oatpats  ar*  not  synchronised.  Indeed,  *aoh  DM 
introdaets  a  different  delays  th*  maximum  delay  it 
given  by  th*  maximum  value  of 


*  (30) 


It  it  assumed  that  ap  to  M  distinct  decisionmakers 
may  b*  used  in  th*  aingl*  echelon. 


over  all  a.  If  thl*  delay  it  anaecaptabla ,  them  nor* 
efficient  DMs  ar*  needed. 


The  decomposition  nf  the  inpat  vector  allow*  for 
th*  parallel  processing  of  th*  signals  generated  by 
th*  6  source*.  No  farther  division  into  sabtasks  is 
possible  Sine*  every  on*  of  tm*  0  sabtasks  arriving 
at  a  r»:e  S"1  cannot  be  processed  by  any  on*  DM 
without  causing  overlosd,  th*  eecoau  information 
redaction  strategy  (creation  of  slack  resources)  mast 
be  w.ed.  Alternate  processing  of  signals  generated  by 
each  source  will  allow  additional  time  for  each  DM  to 
4.  th*  processing  and,  therefore,  overlosd  mny  be 
avoided.  The  resalting  processing  mod*  it  an 
integrated  parallel/alternet*  processing. 


Stem  4:  OPTIMIZATION 

In  this  problem,  where  M  distinct  DMs  ar* 
available,  the  namber  of  decis'onmakers  that  can 
process  th*  Incoming  signals  li  a  reasonable  objective 
function  to  be  minimised.  Since  th*  DMs  do  not  have 
identical  properties  (parameters),  the  problem  cannot 
be  decoupled  into  0  distinct  optimisation  problems, 
even  though  no  decisionmaker  is  allowed  to  process 
signsls  from  more  than  on*  sonre*.  Th*  resulting 
mathematics!  programming  (ME)  problem  mast  be  solved 
algorithmically.  Furthermore,  th*  nonlinearity  of  on* 
of  th*  constraints  makes  it  a  difficult  one  to  solve. 


Step  3:  MATHEMATICAL  MODEL 

(a)  Since  alternate  processing  is  assumed  for  th* 
output  of  etch  source,  the  requirements  thst 
all  signals  be  processed  reduces  to  the 
condition  that  th*  sum  of  th*  symbol 
assignment  frequencies  for  th*  output  of  each 
source,  qJB .  must  be  equal  io  unity,  i.*., 

M 

5  V  “  1  *  “  1,2 . 0  <2«> 

m-1 

(b)  In  order  that  ao  DM  be  overloaded,  th* 
frequency  with  which  each  DM  receives  a 
signal  for  processing  should  be  sufficiently 
low  so  that  hit  mesa  processing  time  does  not 
exceed  th*  effective  mesa  intersrrival  tin*. 
This  condition  becomes: 


Step  5:  INFORMATION  STRUCTURE 


Th*  solution  to  th*  nonlinetr  MP,  .amcly,  th* 
values  of  M  ,  q  ,  and  T  ,  defines  the  information 
structure.  Th*  single  echelon  is  composed  of  only 
those  DMs  for  which  tL*  corresponding  frequ  scy  q  is 
strictly  positive.  Th*  information  structure.  Figure 
4,  specifies  th*  decitionnsker,  the  group  components 
L*  h*  processes ,  and  the  frequency  q  with  which  he 
is  assigned  these  inputs. 


CONCLUSION 


An  approach  to  th*  design  of  information 
structures  for  a  singl*  echelon  organization  has  been 
presented.  This  approach  is  based  on  the  properties 
of  th*  inpi ts,  th*  characteristics  of  th*  available 
DMs,  and  th*  constraints  imposed  on  th*  orgsaizstion 
by  th*  task.  Two  basis  information  reduction 
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strategic*,  ereatloa  of  salf-eoatalaad  tasks  sad 
creation  of  alaok  raaoaroa*.  war*  aodelad  ta  parallel 
aad  altaraat*  proeasaiag,  respective'y.  It  was  then 
•kora  tkat  eoaplas  laforaation  stractaraa  can  be 
ooastraoted  asiag  combinations  of  parallel  aad 
alternate  processing.  The  foraer  is  appropriate  when 
am  overall  task  eaa  be  divided  into  sob  task*)  the 
latter,  vhea  delays  ia  producing  an  output  can  be 
tolerated  aad  tka  task  eanaot  be  divided. 

The  aezt  aajor  step  ia  this  research  is  the 
integration  of  tka  single  eehalon  with  other  parts  of 
tka  organisation.  The  single  echelon  is  responsible 
for  traaaaitting  the  processed  inpat*  to  tha 
appropriate  destinations  within  the  organixation.  This 
transmission  of  processed  data  to  otkar  members  in  tha 
organixation  is  referred  to  as  aerial  processing. 

Tha  design  of  anltleehalon  a true  tares  require* 
each  echelon  to  process  its  information  without 
overload.  The  constraints  on  each  echelon,  however, 
most  be  inferred  from  the  constraints  that  are  imposed 
on  the  overall  organixation.  This  introdnoea  a  higher 
level  of  coaqplexlty  to  the  design  problem. 
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INTRODUCTION 
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The  key  assumptions  upon  which  the  development  of 
game  theory  was  initially  based  are  the  following: 

Al\  The  rules  of  the  game  are  common  knowledge 
to  all  the  players  of  the  gate. 

A2;  The  players  have  the  same  perception  (model) 
of  the  game. 

A3^  Play,  a  arc  fully  committed  to  a  priori 
strategies. 

A4)  Players  are  rational. 

|  Assumptions  (A1)-(A3)  are  quite  restrictive  as 

'cj.  they  do  not  hold  In  many  real-life  economic,  political, 
/  military  and  other  social  situations.  This  la  why,  as 
game  theory  developed,  attempts  were  made  to  relax  some 
of  these  assumptions .  .  Assumption  A3  was  a  consequence 
/'of  the  normal! zatlcTTprlnlcple  of  Von-Neuaann,  (12,  pp 
I  79-84]  which  roughly  says  that  given  an  extensive  game, 

I  one  can  always  reduce  it  to  an  equivalent  game  In  nor- 

I  mal  form  Involving  only  strategics  and  payoffs  and 
(  where  all  dynamic  and  Informational  aspects  of  the  ori- 
j  glnal  problem  have  been  expressed  In  the  form  of  stra¬ 
tegies  by  considering  all  the  possible  actions  of  all 
the  players  under  all  possible  circumstances.  Aumann 
and  Maschler  U]  were  the  first  tp  point  out  via  a  sim¬ 
ple  counterexample  the  Inappropriateness  of  the  normcl- 

- lzation  principle  under  certain  conditions ;  since  then 

conslderaele  developments  followed  by  relaxing  the  as¬ 
sumption  of  prior  commitment  (2]-(6].  Harsanyl  (7)  and 
Aumann-Maschler  et  al.  (8]  pointed  out  that  in  some 
military  problems,  players  may  lack  full  information 
about  the  payoff  functions  of  other  players ,  or  about 
the  physical  facilities  and  strategies  of  other 
players,  or  even  about  the  amount  of  information  that 
other  players  have  about  the  various  aspects  of  the 
game  situation.  Thus,  Harsanyl  17]  first  relaxed  as¬ 
sumption  Al  and  formulated  and  developed  models  of 
games  of  Incomplete  Information.  Considerable  progress 
has  been  achieved  in  the  theoiy  of  games  of  Incomplete 
Information  since  Harsanyi's  original  formulation  (see 
18]— ( 10]  and  references  therein). 

'  f  S  ! 

;  .  In  this  paper, twe  shall  telayyssumptlons  Al  and 
A2,  ye~~aKall  modify  assumption  A4,  and  we  shall"  formu- 
/'TaTe-a  claps  of  gazes  which 'we  shall  sail  ^“Subjective 
l  Games'v.  vie  shall  consider  that""fhe  players  have  dif- 
\  ferent  perceptions  (models)  of  the  gene  that  Is  being 
>  .  played.  Vjgrshai*  fuither  assumel^hat  each  player 
J  considers  that  the  other  players  have  models  of  the 

rJ 
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game  which  Is  the  same  as  )ils  own  model.  Moreover,  *^e 
shall  consider-  that  each  player  is  rational  within  his 
own  subjective  view  of  t^ie  decision  problem.  Under 
these  assumptions,  we -shall  study  one-stage  games  -s 
well  as  repeated  games,  hilarious  interesting  issues 
arise  because  of  the  new  assumptions: 


Ql.  How  are  equilibrium^ strategies  defined  for 
subjective  games?  S\ 

Q2.  How  do  these  equilibrium  strategies  lelate 
to  the  equilibrium  stategles  of  the  games 
studied  so  far? 

Q3.  Do  player*  realize  during  the  play  of  the 
game  that  they  have  different  models? 

Q4.  If,  during  the  play  of  the  game,  the 

olayers  realize  that  they  have  different 
models,  how  do  they  modify  their  models 
and  their  strategies? 

Q5.  Does  repetition  of  the  game  result  in  co¬ 
operation  as  in  the  case  of  the  games 
studied  ao  fat  (e.g.,  (11])?  And  does  re¬ 

petition  of  the  game  alleviate  differences 
In  the  models  of  the  players  and  allow  < 

players  to  agree  on  an  equilibrium  strategy? 

Q6.  How  does  bargaining  take  place  In  subjec¬ 
tive  games?  What  is  the  effect  of  the 

_  ba ’'gaining  model  used  or  the  outcome  of 

bargaining? 

07,  Is  it  possible  to  characterize  the  set  of 
al’  equilibria  for  repeated  subjective 
games? 

To  study  and  understand  some  of  these  questions, 
we  shall  consider  a  special  class  of  games,  ntnely  2*2, 
two  person  non-zero  sum  games  of  incomplete  lnfotmatlon 
where  the  payoff  matrices  have  a  special  structure. 

The  rert  of  the  paper  is  organized  as  follows:  In 
Section  2,  we  present  the  model  for  subjective  games; 
specifically,  we  state  the  basic  assumptions  under 
which  the  theory  will  be  developed.  In  Section  3,  we 
briefly  discuss  games  of  Incomplete  information  and 
point  out  the  differences  bewtween  Harsanyi’s  model  and 
our  model.  In  Section  4,  we  consider  static  subjective 
non-cooperative  games  of  incomplete  information;  we 
study  informational  aspects  of  these  games  and  contrast 
the  results  to  those  of  claseical  games.  Conclusions 
are  presented  in  Section  5.  This  paper  is  part  of  a 
larger  report  (16)  where  infinitely  repeated  non- 
cooperative  subjective  games  of  incomplete  Information 
are  studied  and  solved.  The  section  on  repeated  sub¬ 
jective  games  has  been  omitted  due  to  space  limitations 
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The  contributions  of  this  psper  are  as  follows: 

(1)  It  presents  a  oodel  for  subjective  games 
which  relaxes  some  of  the  restrictive  key  assumptions 
of  game  theory  previously  made. 

(11)  It  studies  a  simple  class  of  static  subjective 
games  and  their  informational  aspects;  It  shows  Che  ef¬ 
fect  of  the  subjective  models  on  the  solution  of  the 
game  and  contiasts  the  value  of  information  In  these 
games  to  tho  value  of  information  in  classical  static 
Bayesian  games. 

2.  THE  MODEL  FOR  SUBJECTIVE  GAMES 

We  shall  develop  our  theory  of  subjective  games 
based  on  the  following  key  assumptions: 

51.  Players  have  different  perceptions  (dif¬ 
ferent  models)  of  the  gaae. 

52.  Each  player  thinks  that  the  other  players' 
perception  (model)  of  the  game  la  the  same 
as  his. 

53.  Players  are  Bayesian. 

54.  Each  player  Is  rational  within  bis  own 
subjective  view  of  the  game. 

Assumptions  SI  and  S3  imply  that  each  flayer  J 
will  assign  a  subjective  joint  probability  distribution 
Pj  to  all  variables  unknown  to  him.  A  subjective  prob¬ 
ability  distribution  Pj  entertained  by  player  j  is  de¬ 
fined  in  terms  of  his  own  choice  behavior  and  may  bo 
considered  as  his  personal  estimate  of  the  variables 
unknown  to  him  (13J.  In  contrast  to  SI  and  S3,  assump¬ 
tion  A2  combined  with  Bayesian  players  implies  the 
existence  of  an  objective  probability  distribution  P* 
which  Is  defined  in  terms  of  the  long-run  frequencies 
of  the  relevant  events;  these  frequencies  are  estab¬ 
lished  by  an  independent  observer,  the  umpire  of  the 
game.  j 

I 

Assumption  S2  Implies  that  the  rules  of  the  game 
are  not  common  knowledge  to  all  the  players  1  since  each 
player  thinks  that  the  other  players'  perception  of  tne 
game  Is  the  same  as  his,  yet  this  may  not  bh  true. 
Assumptions  SI  and  S2  were  prevloucly  used  In  the  con- 
te^c  of  distributed  estimation  and  detection  [14[. 

Some  of  the  interesting  Issues  that  arise  because  of  SI 
and  S2  are  the  following: 

11.  How  do  players  Interpret  moves  of  other 
players  or  messages  received  from  other 
players? 

12.  Da  players  realize  during  the  play  of  the 
game  that  they  have  different  models  or  do 
they  play  the  game  and  achieve  an  equili¬ 
brium  which  is  acceptable  within  the  terms 
of  their  own  models? 

13.  If  players  discover  that  they  have  different 

models,  how  do  they  reconcile  their 
differences? 

Some  of  the  above  issues  were  studied  and  answered 
in  I14J  in  the  context  of  distributed  estimation  and 
detection. 

Thus,  comparing  S1-S4  with  A1-A4,  we  see  that  only 
the  rationality  assumption  holds.  Under  assumptions 
S1—S4,  several  interesting  Issues  like  Q1-Q7  discussed 
in  Section  1  arise.  We  shall  study  some  of  these 
Issues  in  the  context  of  a  specific  class  of  games, 
namely  games  of  incomplete  Information. 


3.  GAMES  OF  INCOMPLETE  INFORMATION 

Games  of  Incomplete  Information  (CII)  were  first 
formulated  by  Harsanyi  171.  In  his  original  formula¬ 
tion,  Harsanyl  cal leu  CII  thoie  situations  where  "the 
participants  lack  full  information  about  some  important 
aspects  of  the  game  they  are  olaylng*.  Harsanyl  con¬ 
siders  a  n-player  game  and  makes  the  following  key 
assumptions  In  his  formulation: 

HI.  Each  player  has  certain  Information  about 

the  game.  This  information  is  described  by  a 
vector 

,  T  T.T 

ci  -  [“l.biJ 

The  component  ai  represents  the  private  In¬ 
formation  of  player  1  about  the  payoff  func¬ 
tions  Jl ,  J2»***  ,Jn  <‘f  the  n  players  (the  com¬ 
ponent  bi  will  be  defined  below). 

H2.  In  dealing  with  incomplete  Information,  each 
player  takes  a  Bayesian  approach.  That  Is, 
each  player  assigns  a  subjective  probability 
distribution  Pj, 

P1  “  pl  <ei,  C2,***,ct-i,  ci+i,*”,cn) 

A  P  (ci)  (3-1) 

to  all  the  variables  unknown  to  him  and  at¬ 
tempts  to  maximize  the  mathematical  expecta¬ 
tion  of  his  own  payoff  Ji  in  terms  of  this 
prooablllty  distribution  Pi , 

H3.  The  other  players  do  not  know  the  subjective 
probability  Pj  used  by  player  1.  But  each 
player  j  (j*i)  con  write  Pi  in  the  form 

pi  (c1)  -  R!  (c1^)  (3-2) 

where  Rj  is  a  function  whose  mathematical 
form  Is  known  to  all  players ,  and  bj  is  a 
vector  consisting  of  those  parameters  of  the 
function  Pi  that  are  known  only  to  player  1. 
Thus,  each  player  has  a  probability  distri¬ 
bution  over  the  sub j active  distributions  P* 
of  the  other  players,  and  these  probability 
distributions  are  common  knowledge  to  all  the 
players  since  the  mathematical  form  of  each 
Ri  is  known  to  all  players.. 

Hatsanyl  shows  that  under  H1-H3,  the  GII  is  Bayes 
equivalent  to  a  Game  of  Complete  Information  IGCI)  for 
each  player,  (i.e.,  completely  equivalent  to  a  GCI  for 
each  player  from  a  game  theoretic  standpoint),  if  there 
is  a  probability  distribution  R*(ci,  c2,  •••,  cn)  such 
that 

R*  (ciloi.bt)  -  Rt  (c*|at,bt)  Vi  .  (3-3) 

Furthermore  he  shows  that  the  uormal  form  for  the 
equivalent  Bayesian  game  Is  in  many  cases  an  unsatis¬ 
factory  representation  of  the  game  situation  and  has  to 
be  replaced  by  other  representations,  t.e.,  the  semi¬ 
normal  (or  extensive)  form,  where  Bayesian  games  must 
be  Interpreted  as  games  with  delaved  commitment. 

Thus,  Harsanyl's  formulation  drasdcally  departs 
from  the  standard  formulation  of  game  p-oblems  as  It 
relaxes  two  of  the  basic  assumptions,  namely  A2  and  A3. 

Following  Harsanyl's  original  formulation,  GII 
weie  analyzed  by  many  researchers  (see  [91  -[10)  and 
references  therein),  and  the  sets  of  equilibria  for 
zero  and  non-zero  sum,  static  and  repeated  GII  were 
determined. 
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The  study  of  subjective  ganes  under  assumptions 
S1-S4  will  be  based  on  subjective  G1I.  Our  model  dif¬ 
fers  from  that  of  Harsanyi  in  one  basic  point. 
Harsanyl's  formulation  assumes  that  the  rules  of  the 
game  are  common  knowledge  to  all  the  players  (i.e.,  as¬ 
sumption  A1  holds)  ,  This  happens  because  each  player 
knows  wr.at  information  is  available  to  other  players 
and  he  also  knows  the  functional  form  of  the  subjective 
Joint  probability  distribution  entertained  by  each 
other  player.  Our  formulation  assumes  that  the  rules 
of  the  games  are  not  common  knowledge  to  all  players. 
This  happens  because  each  player  has  his  own  subjective 
probability  about  the  game  and  In  addition  he  thinks 
that  all  the  other  players  share  the  same  view  about 
the  gone  as  he  does.  In  spite  of  this  difference  a  lot 
of  results  from  GII  will  be  useful  In  analyzing  subjec¬ 
tive  GII  and  In  showing  similarities  and  differences 
between  these  two  classes  of  games. 

4.  STATIC  SUBJECTIVE  GAMES  OF  INCOMPLETE  INFORMATION 


B2.  Player  2  thinks  that  chance  selects  game  1 
with  probability 

P(l)  -  q  <  1/2  (4-6) 

B3.  Each  player  thlnka  that  the  other  player* a 
model  of  the  game  Is  the  same  as  his  own 
model 

B4.  Chance  actually  selects  game  1  with 
probability  P(l)  -  r 

We  further  consider  that 

BS.  Each  player  la  rational  within  his  own 
subjective  view  of  the  game. 

Assumptions  Bl-BS  are  the  analogs  of  S1-S4  for  the 
specific  game.  Under  these  assumptions,  we  shall  study 
the  solution  of  the  game  as  well  aa  various  Infor¬ 
mational  aspects  of  It. 


In  this  section,  we  consider  static  non-zero  sum 
games  of  Incomplete  Information.  Ue  present  a  class  of 
subjective  games  with  simple  solutions,  we  study  the 
value  of  public,  private  and  secret  Information,  and  we 
contrast  the  results  with  those  of  the  classical  static 
Bayesian  games. 


4.2  The  Solution  of  the  Game 

We  study  the  problem  formulated  above  under  three 
different  types  of  Information  that  a  player  may  re¬ 
ceive: 


4. 1  Problem  Formulation 

We  consider  the  following  static  twe  person 
r.on-zero  sum  game.  Chance  selects  one  of  two  gases 
with  the  following  payoff  matrices: 


1.  Public  Information 

In  this  case,  both  players  are  Informed  about  the 
outcome  of  the  chance  move. 

2.  Private  Information 


Game 


X  /( C11 .  <=ll) 

“  \(<=12.  C2l) 


T 


(c21i 

(c22- 


(4-1) 


In  this  case,  one  player  (say  player  1)  is  In¬ 
formed  about  the  outcome  of  the  chance  move  where¬ 
as  the  other  (player  2)  Is  not.  Moreover,  the 
uninformed  player  knows  that  his  opponent  la  In¬ 
formed. 

3.  Secret  Information 


Game  2 


(<=12.  <=12) 

,(CU.  c2i) 


We  further  assume  that 


(  c22. 

(c21. 


(4-2) 


cil  >  C21  >  c12  >  <=22 


(4-3) 


In  this  case,  one  player  la  Informed  about  the 
outcome  of  the  chance  move  whereas  the  other  la 
uninformed.  Moreover,  the  uninformed  player  Is 
unaware  that  his  opponent  la  Informed. 

For  each  of  these  games,  we  present  the  solution 
of  the  game.  Moreover,  In  order  to  compute  the  value 
of  Information  for  each  case,  ue  need  to  find  the  solu¬ 
tion  of  the  game  where  no  player  Is  Informed.  Thus,  we 
solve  the  game  for  this  case  too.  For  each  case,  each 
player  perceives  a  game  which  is  different  frou  the 
game  perceived  by  the  other  player.  Therefore,  In  some 
cases,  we  shall  have  to  consider  two  different  games  In 
order  to  find  the  solution  for  each  case. 


<=12  +  c?i  >  cji  +  c22 


(4-4)  1.  Public  Information 


Player  1  can  choose  action  (l,u)  and  player  2  can 
choose  action  o,T. 

Note  that  because  of  4-3  each  player  has  a  dominant 
strateg>  in  each  one  of  the  two  games.  So  far,  the 
statement  of  the  problem  and  the  assumptions  (4-3)  - 
(4-4)  are  essentially  the  same  as  In  115].  However, 
contrary  to  (15],  we  now  assume  that  the  two  players 
have  a  different  perception  of  the  game,  namely: 

Bl.  Player  1  thinks  that  chance  selects  game  1 
with  probability 

P(i)  -  p  >  1/2  (4-5) 


In  this  case,  player  1  plays  1  in  Game  1  and  p  In 
Game  2.  Player  2  play3  a  In  Game  1  and  T  In  Game  2. 
Thus,  the  outcomes  are  (cn,  cn)  in  Game  1  and  (c21, 
C2i)  in  Game  2  and  the  payoff  of  the  players  Is 

jB  -  jB  -  rCjj  +  (i-r)  C2j  .  (4-7) 

2.  Private  Information 

2a.  Assume  at  first  that  player  1  Is  Informed.  Then 
he  plays  o  In  Game  1  and  u  in  Game  2.  Player  2  has  two 
options.  Choose  either  o  or  T.  If  he  chooses  o,  he 
expects  a  payoff  equal  to 

q  cii  +  (1— q)  C22  •  (4-8) 
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If  he  chooses  T  Chen  he  expects  a  payoff  equal  to 

q  cj2  +  (1-q)  C2i  •  (4-9) 

Because  of  (4-4)  and  (4-6),  player  2  prefers  to  play  T. 
Thus,  the  payoffs  of  the  two  players  are 

-  C2i  (4-10) 


J|  ■  rci2  +  (1-r)  C2)  . 


3b.  If  player  2  Is  secretly  Informed  then,  because  In 
each  game  he  has  a  dominant  strategy,  he  plays  a  in 
Game  1  and  T  in  Game  2.  Player  1  acts  considering  that 
the  following  game  Is  being  played: 


C  Pel  l+<l“p)ci2)  *(pcu+(l“p)ci2) 


(pc2i+0~P)c22)  .(pci2+d*P)cn) 


2b.  If  player  2  is  Informed,  then  he  plays  a  In  Game  1 
and  T  in  Game  2.  An  argument  similar  to  thac  of  Case 
2a  ahows  that  if 

C1 1  “  c22 

p  <  -  (4-12) 

cil  +  C21  -  ci2  -  C22 

then  player  1  will  play  p.  Otherwise,  he  will  play  X. 
The  expected  payoffs  for  player  1  are  then: 


JP  »  rci2  +  (1-r)  C21 


( pc!2+(l-p)ci l ) ,(pc2l+( l~p)c22) 


(pc22+<l-p)e2i)  ,fpc22-Kl-p)c2l) 


Because  of  (4-3), (4-5),  X  Is  a  dominant  strategy. 

Thus,  player  1  plays  X,  The  payoffs  of  the  two  players 
are: 


*  rcu  +  (1-r)  C22 


(4-21) 


“  rcu  +  (1-r)  C22 


(4-14) 


respectively. 


The  payoff  for  player  2  la: 


■  C2i  (corresponding  to  u)  (4-15) 


J2  “  C11  (corresponding  to  X)  .  (4-16) 


3.  Secret  Information 

3a.  Assume  at  first  that  player  1  Is  secretly 
Informed  about  the  outcome  of  the  chance  move.  Then, 
because  In  each  game  he  has  a  dominant  strategy,  he 
plays  X  In  Game  1  and  u  In  Game  2.  Player  2  thinks 
that  the  following  game  la  being  played: 


(qcil+(l-q)ci2)  .(qcn+(l-q)ci2) 

(qc2i+(l-q)c22),(qci2-Kl-q)cu) 


(qcj2+(l“q)eil)  .(qc2i+(l-q)c22) 

(qc22+(l-q)c2l) ,(qc22+<1-q)c2l) 

(4-17) 

Because  of  (6-3),  ( i-6 )  each  player  has  a  dominant 
strategy  In  this  game.  Thus,  player  2  plays  r.  The 
payoffs  of  the  two  players  are: 


Jf  -  c2l 


(4-18) 


Jf  -  rci2  +  (1-r)  C21  . 


(4-22) 


Finally,  In  trder  to  compute  the  value  of  Infor¬ 
mation  for  each  case,  we  need  to  solve  the  game  for  the 
case  where  none  of  the  players  Is  Informed  about  the 
outcome  of  the  chance  move.  In  this  case,  player  1 
thinks  that  the  game  whose  payoff  matrix  is  given  by 
(4-20)  Is  being  played,  whereas  player  2  thinks  that 
the  game  whose  payoff  matrix  la  given  by  (4-17)  Is 
being  played.  Therefore,  player  1  plays  X  and  player  2 
plays  T.  The  payoffs  of  the  two  players  are: 


J°  -  rc2l  ♦  (1-r)  C22 


JO  -  rcj2  +  (1-r)  cu 


(4-23) 


(4-24) 


Let  us  discuss  now  some  Interesting  features  of 
the  solutions  of  these  gamer..  At  first  note  that  each 
payoff  bimat r lx  Is  symmetric,  hence  in  each  one  of  the 
two  games,  the  players  are  interchangeable.  Thus,  one 
expects  that  for  the  classical  Bayesian  game.  In  the 
case  of  public  or  secret  Information,  the  behavior  of 
the  Informed  and  the  uninformed  player  will  be  Indepen¬ 
dent  of  who  Is  the  Informed  and  who  Is  the  uninformed 
player.  For  example,  in  the  case  of  private  or  secret 
information  If  player  1  were  the  uninformed  player  and 
played  X  (first  row),  we  would  expect  that  If  the 
situation  were  reversed  and  player  2  became  the  unin¬ 
formed  player  he  would  play  o  (first  column).  Also,  In 
the  case  where  no  player  was  Informed  about  the  outcome 
of  the  chance  move,  the  dominant  strategies  would  be 
(X,o)  or  (u,t),  hence  the  outcome  of  the  game  would  be 
either  (X,o)  or  (vi,t).  Consequently,  the  value  of 
private,  secret  or  public  Information  would  be  the  same 
for  both  players.  It  can  be  easily  checked  that  this 
Is  Indeed  the  case  when-  p-q.r.  However,  this  behavior 
Is  not  observed  when  each  player  has  his  own  subjective 
model  of  the  game.  When  player  1  Is  privately  Informed 
about  the  chance  move,  player  2  always  chooses  t 
(second  column);  on  the  other  hand.  If  player  2  Is 
privately  Informed  about  the  outcome  of  the  chance 
move,  player  1  does  not  always  play  u  (second  row). 

When  player  1  is  the  secretly  Informed  player,  player  2 
always  plays  t  (second  column);  If  player  2  is  the 
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secretly  Informed  player,  player  1  aiv.->vu  Flays  X 
(first  row).  When  no  player  la  informed  about  the  out¬ 
come  of  the  chance  move,  the  outcome  of  the  game  is 
(1,t),  These  facts  indicate  that  the  value  of  private 
and  secret  information  is  now  different  for  each 
player.  (Obviously,  the  asymmetry  is  induced  by  our 
assumption  0  <  q  <  1/2  <  p  <  1.)  Indeed,  this  is  true 
as  we  shall  see  J".  the  next  section  where  we  study  the 
value  of  the  information  for  the  above  game. 


4.3  The  Value  of  Information 

The  value  of  Information  is  defined  in  general  as 
follows: 

Vi  »  (Payoff  of  player  i  when  he  knows  the  out¬ 
come  of  the  chance  move)  -  (Payoff  of  player  i  when 
no  player  is  informed  about  the  oucome  of  the  chance 
move.) 

We  shall  compute  the  value  of  public,  private  and 
secret  information  for  each  player. 

1.  Value  of  public  information. 

Because  of  (4-7),  4-23)  and  (4-24)  we  find  that 


V®  ■  r(cii  -  C2i>  +  (1-r)  (c2i  -  c22)  (4-25) 

V|  *  (2r-l)  cii  +  (1-r)  C2i  -  rcj2  .  (4-26) 


2.  Value  of  Private  Information 

Because  of  (4-10),  (4-14)  -  (4-16),  (4-23)  and 
(4-24)  we  find  that: 


V?  -  (1-r)  (c2i  -  C22)  (4-27) 


V2  “  C21  ”  rc12  “  (1-r)  cii 


cil  ~  C22 

if  P  <  -  (4-28) 

C11  +  c2l  ”  c12  -  c22 


v2  “  r<cU  “  c12> 


cu  -  c22 

if  P  >  -  .  (4-29) 

'll  ♦  C21  ~  c22  -  ci 2 


3.  Value  of  Secret  Information 

Because  of  (4-18),  4-22).  (4-23)  and  (4-24),  we 

get: 


Vf  -  (1-r)  (c2i  -  C22)  (4-30) 


Vf  ’  r(cu  -  ci 2)  .  (4-31) 


Thus,  for  the  class  of  games  considered  in  this 
section,  the  value  of  public,  private  and  secret  infor¬ 
mation  differs  from  pliyer  to  player,  whereas  in  the 
classical  Bavesian  framework  the  value  of  public,  priv¬ 
ate  and  secret  information  does  not  depend  on  who  is 
the  informed  and  who  is  the  uninformed  player. 

Consider  the  case  where  player  2  is  privately 
informed , 

C11  -  C22 

p  <  - 

cu  +  c2i  -  cj2  -  C22 

and 

1 

r  -  -  .  (4-32) 

2 

Then,  the  value  of  information  *or  player  2  is, 
according  to  (4-28), 

■  1  1 

yP  »  C21  -  -  C12  -  -  cil  .  (4-33) 


If 


1  1 

c2i  < cu  +  -  cu 

*.  4 


the  value  of  private  information  fot  player  2  is 
negative.  On  the  other  hand,  in  this  situation,  the 
gain  for  player  1  is,  according  to  (4-13)  and  (4-23) 


1 

-  (c12-c22>  >  0 


(4-34) 


Thus,  for  the  class  of  symmetric  games  considered  in 
this  paper,  we  have  a  case  where  the  value  of  private 
information  is  negative  for  the  informed  player  and  the 
uninformed  player  benefits  from  the  situation!  This 
phenomenon  never  occurs  for  this  class  of  games  in  the 
classical  Bayesian  framework,  where  if  the  value  of 
private  information  is  negative  for  the  nformed  player 
the  uninformed  player  cannot  benefit  either  [15J, 

Even  more  surprising  in  this  case  is  the  fact  that 
the  informed  player  wants  to  use  his  private  informa¬ 
tion,  whereas  the  uninformed  player  wishes  that  the 
informed  player  acted  as  if  he  were  not  informed!! 

The  reason  for  all  these  counterintuitive  results 
and  the  differences  between  the  subjective  game  and  the 
classical  Bayesian  game  is  that  each  player  evaluates 
the  game  as  well  as  the  behavior  of  his  opponent  in  the 
game  in  terms  of  his  own  model  and  acts  accordingly. 
Such  subjective  evaluations  lead  to  behavior  whiih 
would  never  occur  in  the  classical  Bayasian  formulation 
as  evidenced  by  the  previous  analysis. 

One  issue  that  naturally  arises  in  these  games  is 
the  following:  How  do  the  players  involved  in  the  game 
interpret  its  outcome?  Do  they  realize  that  they  have 
different  models?  If  neither  player  is  informed  about 
the  outcome  of  the  chance  move,  then  player  1  plays  the 
game  described  by  (4-20)  and  player  2  plays  the  game 
described  by  (4-17),  In  this  situation,  piayer  1  ex¬ 
pects  that  player  2  will  use  strategy  0  and  player  2 
expects  that  player  1  will  use  strategy  p.  At  the  end 
of  the  game,  each  player  finds  out  that  the  outcome  is 
the  opposite  of  what  he  expected.  Since  each  player 
assumes  that  his  opponent  is  rational,  at  the  end  of 
the  game  both  players  conclude  that  they  have  different 
models.  Similar  phenomena  occur  if  one  of  the  players 
is  either  secretly  or  privately  informed. 


In  the  case  of  secret  Information,  the  secretly 
lnforc.ed  player  discovers  at  the  end  of  the  game  that 
his  opponent's  perception  of  the  game  Is  different  from 
his.  On  the  other  hand,  the  uninformed  player  may  1) 
never  discover  that  his  opponent  has  a  different  per¬ 
ception  of  the  game  or,  2)  not  be  able  to  Interpret  his 
opponent's  move  In  terms  of  his  own  model  In  which  case 
he  can  conclude  that  1)  either  his  opponent  has  a 
different  model  of  the  game  or  (most  likely),  (11)  his 
opponent  has  secret  Information. 

In  the  case  of  private  Information,  the  uninformed 
player  Is  not  in  a  position  to  discover  at  the  end  of 
the  game  that  his  opponent  has  a  different  view  of  the 
ga~e.  The  Informed  player  may  or  may  not  (depending  on 
whether  (4-12)  holds)  discover  at  the  end  of  the  game 
that  he  and  his  opponent  have  Inconsistent  beliefs 
sbout  the  game. 

Note  that  If  both  p,q  >  1/2  or  p,q  <  1/2,  the 
players  never  discover  the  differences  In  their 
models. 


4.4  Summary 

In  this  section,  wc  presented  and  analyzed  a  sim¬ 
ple  class  of  two  person  non-cooperative,  non-zero  sum 
one-stag-  subjective  games  of  Incomplete  Information. 

We  showed  how  the  Inconsistent  beliefs  of  the  players 
lead  to  results  which  are  counterintuitive  and  differ¬ 
ent  from  the  results  of  the  classical  Bayesian  game, 
and  how  private  or  secret  Information  Is  differently 
evaluated  by  each  player. 

One  Important  Issue  that  has  not  been  discussed  so 
far  Is  the  following:  Are  the  differences  between  the 
two  players  amplified  or  smoothed  out  if  the  game  Is 
repeated  over  and  over?  This  Issue  will  not  be  dis¬ 
cussed  here  due  to  space  limitation.  The  analysis  of 
the  repeated  subjective  game  of  incomplete  Information 
appears  In  (16). 


5.  SUMMARY  -  CONCLUSIONS 

In  this  paper,  we  presented  a  formulation  of 
subjective  games.  This  formulation  relaxes  two  of  the 
Hey  assumptions  upon  which  game  theory  was  originally 
developed.  1)  the  assumption  that  all  players  have  the 
same  model  of  the  game  and  2)  the  ass’tmptlon  that  all 
the  rules  of  the  game  are  common  knowledge  to  the 
players.  To  Illustrate  the  differences  between  our 
formulation  and  previous  formulations  of  games,  we 
restricted  attention  to  a  simple  specific  class  of 
games  of  Incomplete  Information.  We  showed  that  even 
for  this  simple  class  of  games,  the  Inconsistent 
beliefs  of  the  players  lead  to  results  which  are 
counterintuitive  and  different  from  the  results  of  the 
corresponding  classical  Bayesian  games. 
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ABSTRACT 


He  study  the  computational  complexity  of  finite  ver¬ 
sions  of  the  simplest  and  fundamental  problems  of  dis¬ 
tributed  decision  making  and  we  show  that,  apart  from 
a  few  exceptions,  such  problems  are  hard  (NP-complete , 
or  worse) .  Some  of  the  problems  studied  ore  the  well- 
known  team  decision  problem,  the  distributed  hypothesis 
testing  oroblem,  as  well  as  the  problem  of  designing 
a  communications  protocol  that  guarantees  the  attain¬ 
ment  of  a  prespecified  goal  with  as  little  communica¬ 
tions  as  possible.  These  results  indicate  the  inherent 
difficulty  of  distributed  decision  making,  even  for 
very  simple  problems,  with  trivial  centralised  counter¬ 
parts  and  suggest  that  optimality  may  be  an  elusive 
goal  of  distributed  systems. 

1.  Introduction  and  Motivation 

In  this  paper  we  formulate  and  study  certain  simple 
decentralized  problems.  Our  goal  is  to  formulate  pro¬ 
blems  which  reflect  the  inherent  difficulties  of  decen¬ 
tralization!  that  is,  any  difficulty  in  this  class  of 
problems  is  distinct  from  the  difficulty  of  correspond¬ 
ing  centralized  problems.  This  is  accomplished  by 
formulating  decentralized  problems  whose  centralized 
counterparts  are  either  trivial  or  vacious. 

One  of  our  goals  is  to  determine  a  boundary  between 
''easy*  and  "hard"  decentralized  problems.  Our  results 
will  indicate  that  the  set  of  "easy"  problems  is 
relatively  small. 

All  problems  to  be  studied  are  imbedded  in  a  dis¬ 
crete  framework:  the  criteria  we  use  for  deciding 
whether  a  problem  is  difficult  or  not  come  from  com¬ 
plexity  theory  [Garey  and  Johnson,  1979 i  Papadimitriou 
and  Steiglitz,  1982] :  following  the  tradition  of  com¬ 
plexity  theory,  problems  that  may  be  solved  by  a  poly¬ 
nomial  algorithm  Jure  considered  easyi  NP-complete,  or 
worse,  problems  are  considered  hard.*  However,  an 
NP-completeness  result  does  not  close  a  subject,  but  is 
rather  as  a  result  which  can  guide  research:  further 
research  should  focus  on  special  cases  of  the  problem 
or  on  approximate  versions  of  the  original  problem. 

The  main  issue  of  interest  in  decentralized  systems 
may  be  loosely  phrased  as  "who  should  communicate  to 
whom,  what,  how  often  etc."  From  a  purely  logical 
point  of  view,  the  first  question  that  has  to  be  raised 
is  "are  there  any  communication  necessary7”  Any 
further  questions  deserve  to  be  studied  only  if  we 
come  to  the  conclusion  that  communications  are  indeed 
necessary. 

The  subject  of  Section  2  is  to  characterize  the  in¬ 
herent  difficulty  of  the  problem  of  deciding  whether 
any  communications  are  necessary,  for  a  given  situa¬ 
tion.  We  adopt  the  following  approach:  a  decentral¬ 
ized  system  exists  in  order  to  accomplish  a  certain 
goal  which  is  externally  specified  and  well-known.  A 


set  of  processors  obtain  (possibly  conflicting)  obser¬ 
vations  on  the  state  of  the  environment.  Each  processor 
has  to  make  a  decision,  based  on  his  own  observation. 
However,  for  each  state  of  the  environment,  only  certain 
decisions  accomplish  the  desired  goal.  The  question 
"are  there  any  communications  necessary?"  may  be  then 
reformulated  as  "can  the  goal  be  accomplished,  with 
certainty,  without  any  cconunications?"  He  show  that 
this  problem  is,  in  general,  a  hard  one. 

We  then  impose  some  more  structure  on  the  problem, 
by  assuming  that  the  observations  of  different  proces¬ 
sors  jure  related  in  a  particular  way.  The  main  issue 
that  we  address  is  "how  much  structure  is  required  so 
that  the  problem  is  an  easy  one?”  and  we  try  to  deter¬ 
mine  the  boundary  between  easy  and  hard  problems. 

In  Section  3  we  formulate  a  few  problems  which  are 
related  to  the  basic  problem  of  Section  2  and  discuss 
their  complexity. 

In  Section  4  we  study  a  particular  (more  structur'd) 
decentralized  problem  -  the  problem  of  decentralized 
hypothesis  testing  -  on  which  there  has  been  soma  in¬ 
terest  recently,  and  characterize  its  difficulty. 

Suppose  that  it  has  been  found  that  communications 
are  necessary.  The  next  question  of  interest  is  "what 
is  the  least  amount  of  communications  needed?"  This 
problem  (Section  S)  is  essentially  the  problem  of  desig¬ 
ning  an  optimal  communications  protocol;  it  is  again  a 
hard  one  and  we  discuss  some  related  issues. 

In  Section  6  we  present  our  conclusions  and  discuss 
the  conceptual  significance  of  our  results.  These  con¬ 
clusions  may  be  summarized  by  saying  that: 

a)  Even  the  simplest  (exact)  problems  of  decentralized 
decision  making  are  hard. 

b)  Allowing  some  redundancy  in  communications,  may 
greatly  facilitate  the  (off-line)  problem  of  desig¬ 
ning  a  decentralized  system. 

c)  Practical  cosraunications  protocols  should  not  be  ex¬ 
pected  to  be  optimal,  as  far  as  minimization  of  the 
amount  of  conmunications  is  concerned. 

Some  of  the  results  of  this  paper  appear  in 
[Fapadir.itriou  and  Tsitsiklis,  1983]  and  (almost)  all 
proofs  may  be  found  in  [Tsitsiklis,  1983], 

2.  A  Problem  of  Silent  Coordination 

In  this  section  we  formulate  and  study  the  problem 
whether  a  sat  of  processors  with  different  information 
may  accomplish  a  given  goal  -with  certainty-  without 
any  communications. 

Let  (l,...,M)  be  a  set  of  processors.  Each  processor, 
say  processor  i,  obtains  an  observation  y.  which  comes 
from  a  finite  set  of  possible  observations.  Then, 

processor  i  makes  a  decision  u^  which  belongs  to  a 
finite  set  U.  of  possible  decisions,  according  to  a 
rule.  1 


*  Research  supported  by  ONR  under  contract  ONR/N00014-77-C-0532  (NR-041-519) . 

*  One  way  of  viewing  NP-complete  problems,  is  to  say  that  they  are  effectively  equivalent  to  the  Traveling 
Salesman  problem,  which  is  well-known  to  be  algorithmically  hard. 
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(2.1/ 


following : 
Theorem  2.1/ 


where  is  some  function  from  Y.  into  U,.  The 
M-tupla  <y  , .... y  )  is  the  total1 information  avail¬ 
able/  so  it  may  be  viewed  as  the  "state  of  the  envi¬ 
ronment. "  For  each  state  of  the  environment,  we  assume 
that  only  certain  M- tuples  (u, , . . .  .u^  of  decision  ac¬ 
complish  a  given,  externally  specified,  goal.  More 
precisely,  for  each  (y1,...,y}f)e  Y^x. .  .xVM  we  are  given 

a  set  S(v.,...,y..)  C  U.x...xU„  of  satisficing  decisions. 

X  n  1  n 

(So,  S  may  be  viewed  as  a  function  from 
Ox. .  .XU 

Y.xY  x. .  .x  Y  into  2  1  ") . 

12  m 

The  problem  to  be  studied,  which  we  call  "distri¬ 
buted  satisficing  problem"  (after  the  term  introduced 
by  H.  Simon  (1980])  may  be  described  formally  as 
follows/ 

Distributed  Satisficing  (PS) i  Given  finite  sets  Y^ , . . ., 
Yjj,  01»...,0M  and  a  function  Si  Yj^  x...x  Y^  ♦ 

O  X...X  UM 

2  1  ,  are  there  functions  3^/  Y^  ■»  U^,  i»l, 

2,...,M,  such  that 

<3i<Yl> . JM(yH,)eS(yl'  •  ” '  V  '  v(yl'  •  •  ■  'V® 

Y^  X. .  .X  Yji  (2.2) 

Remarks: 

1.  Me  are  assuming  that  the  function  S  is  "easily 
computable/*  for  example,  it  may  be  given  in  the  form 
of  a  table. 

2.  The  centralized  counterpart  of  DS  would  be  to 

allow  the  decision  a  of  each  agent  depend  on  the 
entire  set  (y^ . y1)  of  observations/  so,  would 

be  a  function  from  Y^  x...x  Y^  into  U^.  (T.J.S  cor¬ 
responds  to  a  situ  -tion  in  which  all  processors  share 
the  same  information.).  Clearly,  then,  there  exist 
satisfactory  (satisficing)  functions  di:YjX...xYM  -*U^, 

if  and  only  if  Sly^  . . .  Y(yl»  ....l^SY^.  ..xY^ 

Since  S  is  at.  "easily  computable*  set  as  a  function 
of  its  arguments,  we  can  see  that  the  centralized 
counterpart  of  DS  is  a  trivial  problem.  So,  any  dif¬ 
ficulty  inherent  in  DS  is  only  caused  by  the  fact  that 
information  is  decentralized. 

3.  A  "solution"  for  the  problem  DS  cannot  be  a  closed- 
form  formula  which  gives  an  answer  0(no)  or  l(yes). 
Rather,  it  has  to  be  an  algorithm,  a  sequence  of  ins¬ 
tructions,  which  starts  with  the  data  of  the  problem 
(Y^, . . .  ,Y^,  U^, ... ,U^,S)  and  eventually  provides  the 

correct  answer.  Accordingly,  the  difficulty  cf  the 
problem  DS  may  be  characterized  by  determining  the 
place  held  by  DS  in  the  complexity  hierarchy.  For 
definitions  related  to  computational  complexity  and 
the  methods  typically  used,  the  reader  is  referred  to 
[Garey  and  Johnson,  1979/  Papadimitriou  and  Steiglitz, 
19821. 

4.  If,  for  soma  i,  the  set  0^  is  a  singleton,  proces¬ 
sor  i  has  no  choice,  regarding  his  decision  and,  con¬ 
sequently,  the  problem  is  equivalent  to  a  problem  in 
which  processor  i  is  absent.  Hence,  without  loss  of 
generality,  we  only  need  to  study  instances  of  DS  in 
which  |lh ]>.  2, 

5.  We  believe  that  the  problem  DS  captures  the  es¬ 
sence  of  coordinated  decision  making  with  decentral¬ 
ized  information  and  without  communications  (silent 
coordination) . 

Some  initial  results  on  DS  are  given  by  the 


a)  The  problem  DS  with  two  processors  (M"2)  and  res¬ 
tricted  to  instances  for  which  the  cardinality  of  the 
decision  sets  is  2  (|u^|w2,  i-1,2)  may  be  solved  in 

polynomial  time. 

b)  The  problem  DS  with  two  processors  (M*2)  is  HP- 
complete,  even  if  we  restrict  to  instances  for  which 

|U2I“3. 

c)  The  problem  DS  with  three  (or  more)  processors 
(i;>3)  is  NP-complete,  even  if  we  restrict  to  instances 
for  which  |ui|»2,Vi 

Theorem  2.1  states  that  the  problem  DS  is,  in  gen¬ 
eral,  a  hard  combinatorial  problem,  except  for  the 
special  case  in  which  there  are  only  two  processors 
and  each  one  has  to  make  a  binary  decision.  It  should 
be  noted  that  the  difficulty  is  not  caused  by  an  at¬ 
tempt  to  optimize  with  respect  to  a  cost  function, 
because  no  cost  function  has  been  introduced.  In  game 
theoretic  language,  we  are  faced  with  a  "game  of  kind." 
rather  than  a  "game  of  degree." 

We  will  now  consider  sons  special  cases  (which  re¬ 
flect  the  structure  of  typical  practical  problems)  and 
examine  their  computational  complexity,  trying  to  deter¬ 
mine  the  dividing  line  between  easy  and  hard  problems. 
From  now  on  we  restrict,  our  attention  to  the  case  in 
which  there  are  only  two  processors.  Clearly,  if  a 
problem  with  two  processors  is  hard,  the  corresponding 
problem  with  tliree  or  more  processors  cannot  be  easier. 

We  have  formulated  aoove  the  problem  DS  so  that  all 
pairs  (y^,y2)f?  Y^xY^  are  likely  to  occur.  So,  the 

information  of  different  processors  is  completely  un¬ 
related/  their  coupling  is  caused  only  by  the  structure 
of  the  satisficing  sets  S(y.,y  ).  In  most  practical 

situations,  however,  information  is  not  completely  uns¬ 
tructured/  when  processor  1  observes  y^,  he  is  often 

able  to  make  certain  inferences  about  the  value  of  the 
observation  y  of  the  other  processor  and  exclude  cer¬ 
tain  values.  We  now  formalize  these  ideas: 

Definition/  An  Information  Structure  I  is  a  subset 
of  Wo  saY  that  an  information  structure  I  has 

degree  (D^.Dj)  (D^D^  are  positive  integers)  if 
7T5  For  each  y^fiY^  there  exist  at  most  D^  distinct 
elements  of  Y^  such  that  (y^.y^JS  I. 

(ii)  For  each  y^eY^  there  exist  at  most  D^  distinct 
elements  of  Y^  such  that  (y^.Yjld  I. 

(iii)  D^.Dj  are  the  smallest  integers  satisfying  (i) , 

(ii) .  An  information  structure  1  is  called  classical 
if  D^-Dj-l:  nested  if  D^-l  or  D^-l. 

He  now  interpret  this  definition:  The  information 
structure  I  is  the  set  of  pairs  (y^.y^)  of  observations 

that  may  occur  together.  If  I  has  degree  (D^.D^) 

processor  1  may  use  his  own  observation  to  decide  which 
elements  of  Y^  may  have  been  observed  by  processor  2. 

In  particular,  he  may  exclude  all  elements  except  for 
D.  of  them.  The  situation  faced  by  processor  2  is 
sy/msetr  ical . 

If  D^“l  and  processor  1  observes  y  ,  there  is  only 

one  possible  value  for  y  .  So,  processor  1  knows  the 
observation  of  processor  2.  (The  converse  is  true 
when  D^-l) .  This  is  called  a  nested  information  struc¬ 
ture  because  the  information  of  one  processor  contains 
the  information  of  the  other. 

When  D^-D^-l,  each  processor  knows  the  observation 
of  the  other/  so,  their  information  is  essentially 


3.  Belated  Problems 


shared. 

Since  pairr  (y^y.)  not  in  I  cannot  occur,  there  isno 
meaning  in  requiring^ the  processors  to  make  compatible 
decisions  if  (y^.Yj)  were  to  be  ODserved.  This  leads 

to  the  following  version  of  the  problem  DS: 

DSI;  Given  finite  sets  f.  .Y-.U.  ,U.,  IC  Y.xY.  and  a 
U  xU 

function  Si  1  +  2  1  ,  are  there  functions  d^iY^-*^: 

i-1,2,  such  that 

<?i(yi),a2(y2))es(yi'y2)'  v(yi*y2)el?  (2-3> 

Note  that  any  instance  of  rsi  is  equivalent  to  an  ins¬ 
tance  of  DS  in  which  S(yj.y2>«  U1XU2'  V(yi»yj)£  !• 

That  is,  no  compatibility  restrictions  are  placed  on 
th~  decisions  of  the  tvo  processors,  for  those  (y^.y^) 
that  cannot  occur. 

We  now  proceed  to  the  main  result  of  this  Section! 
Theorem  3.2.2i 

a)  The  problem  DSI  restricted  to  instances  satisfying 
any  of  the  following: 

(i)  One  or  mjre  of  |  |  #  |  | «  to 

(ii)  |ui|-,02|-2. 

(iii)  D^Jj-2, 

(iv)  D  -|U2j-2,  (or  Djl-|ui|-2) 
may  be  solved  in  polynomial  time. 

b)  The  problem  DSI  is  NP-complete  even  if  we  restrict 
to  instances  for  which 


The  result  concerning  the  case  D^*l  or  D2«l  is  not 

surprising.  It  is  well-known  that  nested  information 
structures  may  be  exploited  to  solve  otherwise  dif¬ 
ficult  decentralized  problems.  But  except  for  the  case 
Dj-D2”2  (which  is  sort  of  a  boundary)  the  absence  of 

nestedness  makes  decentralized  problems  computationally 
hard.  Our  result  gives  a  precise  meaning  to  the  state¬ 
ment  that  non-nested  information  structures  are  much 
more  difficult  to  handle  than  nested  ones. 

Theorem  3.2.2  shows  that  even  if  D^jDj  are  held  cons¬ 
tant,  the  problem  DSI  is,  in  general,  NP-complete. 

There  is,  however,  a  special  case  of  DSI,  with  D^.Dj 

constant,  for  which  an  efficient  algorithm  of  the 
dynamic  programming  type  is  possible: 

Theorem  3.2.3:  Let  Yj-tl, . . . ,mj,  Y2»{l,...,n}  and  sup¬ 
pose  that  | i-j |<  D,  V(i,j)e  I.  Then,  if  D  is  held 
constant,  DSI  may  be  solved  in  polynomial  time. 

Remark:  In  fact,  the  conclusion  of  Theorem  3.2.3  re¬ 
mains  true  if  we  assume  m-n  and  we  replace  the  condition 
|  i— j  | <_  D  by  the  weaker  condition  |  i-j  |  (mod  n)<  D. 

The  proof  consists  of  a  small  modification  of  the 
preceding  one. 

The  condition  |i-j|£D,  vU<j)S  I  is  fairly  natural 
in  certain  applications.  For  example,  suppose  that  the 
observations  y^  and  y2  are  noisy  measurements  cf  an 

unknown  variable  x  (y^-x+w^)  where  the  noises  w^  are 

bounded:  |w^|<_  D/2. 

The  condition  |i-j| (mod  n)£  D  may  also  arise  if  the 
observations  y^,  ,y2  are  noisy  measurements  of  some 

unknown  angle:  y^“9  +  w^ 


In  this  Section  we  define  and  discuss  briefly  a  few 
more  combinatorial  problems  relevant  to  decentralized 
decision  making.  All  of  them  will  be  seen  to  be  harder 
than  problem  DS  of  the  last  section  (i.e.  they  contain 
DS  as  a  special  case)  and  are,  therefore  NP-l.ard  (that 
is,  NP-complete,  or  worse). 

The  best  known  static  decentralized  problem  is  the 
team  decision  problem  [Marschak  and  Badner,  1972] 
which  admits  on  elegant  solution  under  linear  quadratic 
assumptions.  Its  discrete  version  is  the  following: 

TDP  (Team  Decision  Problem) :  Given  finite  sets  Y^ ,  \’^ 
0l.O  .  a  probability  mass  function  p:  Y^xY^+Q,  and  a 
coat  function  c:  Y^xYjXU^xUj  -*N,  find  decision  rules 
J  :  Y^  -►U^,  i»l,2  which  minimize  the  expected  cost 

Jt&j.'dj)-  £  £  C<yl'y2’^l(yl>  '^2fy2))P,yi'y2> 

heYl  Y2eY2 

Let  S(y1,y2)-{(u1,u2)eu1xU2:  c (; 1>Y2'ui'’12)“0* .  If 

we  solve  TDP,  we  have  effectively  answered  the  questiai 
whether  there  exist  d-,JJ2  such  that  J  This 

is  equivalent  to  the  question  whether  there  exist  sat¬ 
isficing  decision  rules  (with  the  satisficing  sets 
S(yl(y,)  defined  as  above).  Therefore,  TDP  is  harder 

than  DS: 

Proposition  3.1:  The  discrete  team  decision  problem 

is  NP-hard,  even  if  the  range  of  the  cost  function 
c  is  {0,1}. 

Instead  of  trying  to  “satisfice"  for  every  pair  of 
observations  Yl'Xlf2'  may  ^  I”ore  appr'' 

priate  to  impose  a  probability  mass  function  "jXYj 

and  try  to  maximize  the  probability  of  satisficing. 

This  leads  to  the  next  problem: 

MPS  (Maximize  Probability  of  Satisficing) :  Given 
finite  sets  Yj/y2'Ul,U2'  a  probability  mass  function 

V°2 

p:  YjXYj-rfi  and  a  function  S:  Y^xYj  *-2  ,  find 

decision  rules  Y^HI^,  i»l,2,  which  maximize  the 
probability  of  satisficing  J(d^>d2)  ■ 
pr  ( Oj,  (yx)  ,d2  (y2> ) fed  (y1«y2> J  • 

We  now  take  a  slightly  different  point  of  view. 
Suppose  that  communications  are  allowed,  so  that  the 
processors  may  always  make  satisficing  decisions  by 
communicating  (assuming  that  S(y^,y2)j<$, 

V(yi,y2>e  yixy?) •  Suppose,  however,  that  communica¬ 
tions  are  very  expensive,  so  that  we  are  interested 
in  a  scheme  which  guarantees  satisficing  with  a  mini¬ 
mum  amount  of  communications.  We  will  assume  that  if 
one  of  the  processors  initiates  a  communication,  all 
their  information  will  be  exchange  at  unity  cost. 

(For  a  more  refined  way  of  counting  the  amount  of  com¬ 
munications,  see  Section  3.5.) 

MPC  (Minimize  Probability  of  COBWiunicatlons) :  Giver, 
finite  sets  yj/Y2'uj/°2  a  Profaability  “a®8  function 

V°2 

pi  YjtY^Q  and  a  functional  Y^xY2  -»  2  ,  find 

decision  rules  Y^U^Utc},  i»l,2,  which  minimiza 
the  probability  Prtd^fy^l-C  or  32(y2!"C)  of  conraunica- 
ting  subject  to  the  constraint 


If  and  ^(y^Cl  then  (^  (y^  >32  (y.,)  )<3S  (yify2). 

The  proof  o£  the  following  ic  trivial: 

Proposition  3.2:  The  problems  MPS  and  .1PC  are  NT-hard. 
Ir.  fact,  we  also  have: 

Proposition  3.3:  The  problems  TOP  (with  a  zero-one 
cost  function)  and  MPS  are  NP-hard,  even  if  | | - 1 |  =2  . 

We  could  also  define  dynamic  versions  of  DS  or  of 
the  team  problem,  in  a  straightforward  way  [Tenney, 
19831.  Since  dynamic  problems  cannot  be  easier  than 
static  ones,  they  are  automatically  NP-hard. 

4.  Decentralized  Hypothesis  Testing 

A  basic  problem  in  decentralized  signal  processing, 
which  has  attracted  a  fair  amount  of  attention  recently, 
is  the  problem  of  decentralized  hypothesis  testing 
[Tenney  and  Sandell,  1981;  Ekchian,  1982;  Ekchian  and 
Tenney,  1982;  Kushner  and  Pacut,  1982;  Lauer  and 
Sandell,  1983].  A  simple  version  of  the  problem,  in¬ 
volving  only  two  processors  and  two  hypotheses  may  be 
described  as  follows: 

Two  processors  S^  and  S2  receive  observations  y^SY^, 
y^SY^,  respectively,  where  is  the  set  of  all  pos¬ 
sible  observations  of  processor  i.  (Figure  1).  There 
are  two  hypotheses  Hq  and  on  the  state  of  the 

environment,  with  prior  probabilities  p^  and  p1  res¬ 
pectively.  For  each  hypothesis  H^,  we  are  .  lsc  given 
the  joint  probability  distribution  P(Yj#y_|H^)  of  the 

observations,  conditioned  on  the  event  that  it  is  true. 
Upon  receipt  of  >  ^ ,  processor  evaluates  a  message 

ui«{0,i}  according  to  the  rule  u^*^  (y^  ,  where 

Then,  u^  and  u2  are  transmitted  to  a 


Figure  1;  A  Scheme  for  Decentralized  Hypothesis 
Testing. 


central  processor  (fusion  center)  which  evaluates 

u  =u,n  u  and  declares  hypothesis  H  to  be  true  if 
o  1  2  o 

uq«0,  if  u  “1.  (So,  we  essentially  have  a  voting 
scheme) .  The  problem  is  to  select  the  functions  J  , 

d2  so  as  to  minimize  the  probability  of  accepting  the 
wrong  hypothesis.  (More  general  performance  criteria 
may  be  also  considered) . 

Most  available  results  assume  that 

P(y1.y2|Hi)-P(y1|Hi)P(y2|Hi),  i-1,2,  (4.1) 


which  states  that  the  observations  of  the  two  proces¬ 
sors  are  independent,  when  conditioned  on  either  hy¬ 
pothesis.*  In  particular,  it  has  been  shown  [Tenney 
and  Sandell,  1981]  that  the  optimal  decision  rules 


are  given  in  terms  of  thresholds  for  the  likelihood 
p  P(H  | y .  ) 

ratios  - °  i — : —  .  The  optimal  thresholds  for 

P1 


the  two  sensors  are  coupled  through  a  system  of  equa¬ 
tions  which  gives  necessary  conditions  of  optimality. 
(These  equations  are  precisely  the  person-by-person 
optimality  conditions) .  Few  analytical  results  are 
available  when  the  conditional  independence  assumption 
is  removed  [Lauer  and  Sandell,  1983).  The  approach  of 
this  section  is  aimed  at  explaining  this  status  of  af¬ 
fairs,  by  focusing  on  discrete  (and  finite)  versions 
of  the  problem. 

We  first  have: 


Theorem  4.1:  If  Y^ , V2  are  finite  sets  and  (4.1)  holds, 

then  optimal  choices  for  J  may  be  found  in  poly¬ 
nomial  time. 

So,  under  the  conditional  independence  assumption, 
decentralized  hypothesis  testing  is  a  computationally 
easy  problem.  Unfortunately,  this  is  not  the  case  vben 
the  independence  assumption  is  relaxed.  Our  main  re¬ 
sult  (Theorem  4.2)  states  that  (with  Y^,  Y2  finite 

sets) ,  decentralized  hypothesis  testing  i3  a  hard  com¬ 
binatorial  problem  (NP-hard) .  This  is  true  even  if  we 
restrict  to  the  special  case  where  perfect  detection 
(zero  probability  of  error)  is  possible  for  the  corres¬ 
ponding  centralized  hypothesis  testing  problem. 

Although  thir  Is  in  some  sense  a  negative  result,  it  is 
useful  because  it  indicates  the  direction  in  which 
future  research  on  this  subject  should  proceed: 

Instead  of  trying  to  find  efficient  exact  algorithms, 
research  should  focus  on  approximate  algorithms,  or 
exact  algorithms  for  problems  with  more  structure  than 
that  assumed  heze.  Moreover,  our  result  implies  that 
any  necessary  conditions  for  optimality  to  be  developed 
are  likely  to  be  deficient  in  one  of  two  respects; 

a)  Either  there  will  be  a  very  large  number  of  deci¬ 
sion  rules  satisfying  these  conditions. 

b)  Or,  it  will  be  hard  to  find  decision  rules  satis¬ 
fying  these  conditions. 

In  particular,  optimal  decision  rules  are  not  given  in 
terms  of  threshold:-  an  likelihood  ratios. 

Of  course,  there  remains  the  question  whether  ef¬ 
ficient  approximate  algorithms  exist  for  the  general 
decentralized  hypothesis  testing  problem,  or  whether 
we  must  again  r-.strict  to  special  cases  of  the  problem. 
We  now  present  formally  the  problem  to  be  analyzed. 


DKT:  (Decentralized  Hypothesis  Testing,  Restricted  to 
Instances  for  which  Ferfcct  Centralized  Detection  is 
Possible) . 

We  are  given  finite  sets  Y^.Y2;  a  rational  number 

number  k;  a  rational  probability  mass  function 
p:  YjXY2  -SQO  [0,1] ;  e  partition 


*  Such  an  assumption  is  reasonable  in  problems  of  detec¬ 
tion  of  a  known  signal  in  independent  noise,  hut  i3  typ- 
icall^, violated  in  problems  of  detection  of  an  unknown 
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of  YjXYj.*  Do  there  exist  ♦{0,1}, 

32>V2-*-{0,i}  such  that  J(d2«d2)£k,  where 


O,. S.)  -  53  Pty.-yjJd.iyJd-ty,)  + 

*  ly^yj^o  121122 

(y1?y2)6A1  ^V^'W^W17  «-2> 


emarks;  1.  If  we  let  k»0,  then  DHT  is  a  special  care 
of  problem  r>S  (Section  2),  with  | |«|u2 }-2,  and  is 

polynoaially  solvable,  according  to  Theorem  3.2.1.  In 
general  DHT  is  a  special  case  of  MTS  and  TO?  (Section 
3.3)  with  ] j  —  | [ “2 .  Consequently,  Theorem  4.2 

below  proves  Proposition  3.3. 

2)  Clearly,  the  optimization  problem  (Minimize  J ( ^ )  • 

with  respect  to  cannot  be  easier  than  DHT. 

Since  DHT  will  be  shown  to  be  NP-complete,  it  follows 
that  the  above  optimization  problem  is  HP-hard. 

3)  In  DHT,  as  defined  above,  we  are  only  considering 
instances  for  which  perfect  centralized  detection  is 

possible!  Think  of  H  as  biing  the  hypothesis  that 
o 

(y,  ,yje  A  ,  and  H.  as  being  the  hypothesis  that 

1  4  O  1 

(y^.y  )•  Aj.  Certainly,  if  a  processor  knows  both  y^. 
y2,  the  true  hypothesis  may  be  founa  with  certainty. 

For  the  decentralized  problem.,  the  cost  function 
'Kdj,d2)  is  easily  seen  to  be  the  probability  of  error. 

4)  The  result  to  be  obtained  below  remains  valid  if  the 
fusion  center  uses  different  rules  for  combining  the 
messages  it  receives  (e.g.  Uq” ( it  V (  u^) ) ) ,  or  if  we 

leave  the  combining  rule  unspecified  and  try  to  find 
an  optimal  combining  rule. 

Theorem  4.2i  DHT  is  NP-complete. 


Communications  Protocols 


Suppose  that  we  are  given  an  instance  of  the  dis¬ 
tributed  satisficing  problem  (DS)  and  that  it  was 
concluded  that  unless  the  processors  communicate, 
satisficing  cannot  be  guaranteed  for  all  possible  ob¬ 
servations.  Assuming  that  communications  are  allowed 
(but  are  costly) ,  wo  have  to  consider  the  problem  of 
designing  a  communications  protocol:  what  should  each 
processor  communicate  to  the  other,  and  at  what  order? 
Moreover,  since  communications  are  costly,  we  are 
interested  in  a  protocol  which  minimizes  the  total 
number  of  binary  messages  (bits)  that  have  to  be  com¬ 
municated.  (The  word  "bits"  above  does  not  have  the 
Information  theoretic  meaning . ) 

Before  proceeding,  we  must  rake  more  precise  the 
notion  of  a  communication  protocol  and  of  the  number 
of  bits  than  guarantee  satisficing. 

Given  an  instance  d*(y, ,Y„,U, )U,,I,S)  of  the  problan 

1  2  i  i  -  2 

DSI  we  will  say  that:  1 

There  is  a  protccoi  which  guarantees  satisficing 
with  0  bits  of  courounications,  if  0  is  a  YES  instance 
of  the  problem  DSI.  (That  is,  iflthere  exist  satis¬ 
ficing  decision  rules,  involving  no  communications.) 

We  then  proceed  inductively:  1 
There  is  a  protoool which  guarantees  satisficing 
with  K  bits  of  communications  (KS  N) ,  if  for  some 
ie{l,2}  (say,  i-1)  there  is  a  function  m:Y 
such  that  for  each  of  the  instances  1 

M»,n«  (0),V,,B.,0,,I  Ot(Y,  On:  (0))xY,],S)  and 


•  That  is  A0>- Ai“YixY2  and  A0(~'  Ai"$- 


®"-(yinm'l(l),Y2,U1,U2,I  ntYinm'1(W)xY2].S)  there  i* 

a  protocol  which  guarantees  satisficing  with  not  more 
than  K-l  bits  of  communications .  (Here  m_1(i)« 
{yievi:m(y1)-i}.) 

The  envisaged  sequence  of  events  behind  this  defini¬ 
tion  is  the  following:  Each  processor  observes  his 
measurement  Y^^Y^i  i*l«2.  Then,  one  of  the  processors, 

say  processor  1,  transmits  a  message  mty^),  with  a 
single  bit  to  the  other  processor.  From  that  point  on, 
it  has  become  common  knowledge  that  y^ey^nm-1(y^)  i 

therefore,  the  remaining  elements  of  Y^  may  be  ignored. 
We  can  now  state  formally  the  problem  of  interest: 

MBS  (Minimum  bits  to  satisfice) :  Given  an  instance  V 

of  DSI  and  Ke  N,  is  there  a  protocol  which  gurentees 
satisficing  with  not  more  than  K  bits  of  communicaticns? 

By  definition,  MBS  with  K»C  is  identical  to  the 
problem  DSI.  Moreover,  MBS  with  K  arbitrary  cannot  be 
easier  than  MBS  with  K«Q  (which  is  a  special  case) . 
Therefore,  MBS  is,  in  general  NP-hard.  Differently 
said,  problems  involving  communications  are  at  least 
as  hard  as  problems  involving  no  communications. 

We  have  seen  in  Section  2  that  when  |u,|»|u2|«2, 

DSI  may  be  solved  In  polynomial  time.  Therefore,  MBS 
with  K*0,  |u^|»2,  j U2 1 *2  is  polynomially  solvable. 

However,  for  arbitrary  K,  this  is  no  longer  true: 

Theorem  5.1:  MBS  is  NF -complete,  even  if  | U^| ■ | Uj | ■ 

{0,1}  and  even  if  we  restrict  to  instances  for  which, 
for  any  (y^y^ei,  either  S (yl(y2)- J(0,0)}  or 

s(yl,y2)“{(l,X)}. 

The  t  »ve  theorem  proves  a  conjecture  of  A.  Yao 
[Yao,  1979] .  The  proof  was  mainly  constructed  by 
C.  Papadimitriou  and  may  be  found  in  (Papadimitriou 
and  Tsitsiklis,  1982) . 

We  should  point  out  that  the  special  ca3e  referred 
to  in  Theorem  S.l  concerns  thu  problem  of  distributed 
function  evaluation:  we  are  given  a  Boolean  function 
f :YixY2  ♦  {°,l}  and  we  require  that  both  agents  (proces¬ 
sors  eventually  determine  the  value  of  the  function 
(given  the  observation  -input  (y^y^),  by  exchanging 

a  minimum  number  of  bits.  In  our  formalism) 

s(yi,y2)"  i£  f(Yi,y2'”0  and  s<y1.yaMu.i)> 

if  f(y1.y2)-i. 

In  Section  2  we  had  investigated  the  complexity  of 
DSI  by  restricting  to  instances  for  which  the  set  I 
had  constant  degree  (D^Dj).  This  may  be  done,  in 

principle,  for  MBS,  as  well,  but  no  results  are  avail¬ 
able,  except  for  the  simple  case  in  which  D^Dj-2. 

In  fact,  when  Dj”D2“2  each  processor  may  transmit 
his  information  to  the  othei  agent  by  communicating  a 
single  binary  message  and,  for  this  reason,  we  have: 

Proposition  5,1:  MBS  restricted  to  instances  for  which 
D^-D^  may  be  solved  in  polynomial  time.  Moreover, 
an  optimal  protocol  requires  transmission  of  at  most 
two  binary  messages,  one  from  each  processor. 

When  is  larger  than  (2,2),  there  is  not 

much  we  can  say  about  optimal  protocols.  However,  it 
is  easy  to  verify  that  there  exist  fairly  simple  non- 
optimal  protocols  (which  nay  be  calculated  in  poly¬ 
nomial  time)  which  involve  relatively  small  amount.-  of 
comr unication .  This  is  because: 

Proposition  5,2:  Suppose  that  I  has  degree  (D^,D2)  and 
that  S(Yl,y  W,  V(y2»l’2)ei.  Then  information  may  be 

centralized  (and  therefore  satisficing  is  guaranteed) 
by  means  of  a  protocol  requiring  communication  of  at 
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no.t  Jiog^Djp^  binary  messages  by  each  processor. 

Moreover,  such  a  protocol  may  be  constructed  in  time 
0<<|yiHy2|H|viM*2I>>-  (Here  |xj  ,  xe  R,  stands 

for  the  smallest  integer  larg  :r  than  x.) 

Remark  li  It  might  be  tempting  to  guess  that  proces¬ 
sor  I  (respectively  2)  needs  to  communicate  only 
|log2D2J  (respectively  jlog^D J)  bits,  but  this  is  not 

true,  as  can  be  seen  from  fairly  simple  example". 

6.  Conclusions 

We  summarize  here  the  main  conclusions  of  this  pa¬ 
per. 

Even  if  a  set  of  processors  have  complete  knowledge 
of  the  structure  of  a  distributed  decision  making  pro¬ 
blem  and  the  desired  goali  even  if  the  corresponding 
centralized  problem  is  trivial;  even  if  all  relevant 
sets  are  finite,  a  satisficing  decision  rule  that  in¬ 
volves  no  on-line  communications  may  be  very  hard  to 
find,  the  corresponding  problem  being,  in  ceneral, 
NP-coraplete.  There  are  many  objections  to  the  idea 
tha  NP-completeness  is  an  unequivocal  measure  of  the 
difficulty  of  a  problem,  because  it  is  based  on  a 
worst  case  analysis,  whereas  the  average  performance 
of  an  algorithm  might  be  a  more  adequate  measure; 
moreover  NP-hard  optimization  problems  may  have  very 
simple  approximate  algorithms.  However,  NP-complete 
problems  are  often  characterized  by  che  property  that 
any  known  algorithm  is  very  close  to  systematic  ex¬ 
haustive  search;  they  do  not  possess  any  structure  to 
be  exploited. 

Concerning  the  problem  DS,  and  its  variations,  we 
may  reach  the  following  specific  conclusions:  No  sim¬ 
ple  algorithm  could  solve  DS.  Given  that  communica¬ 
tions  would  be  certainly  required  for  those  instances 
of  DS  that  possess  no  satisficing  decision  rules,  it 
would  not  be  a  great  loss  if  we  allowed  the  proces¬ 
sors  to  conmiunicate  even  for  some  instances  of  DS  for 
which  this  would  not  be  necessary.  Even  if  these 
extra  conmunications  -being  redundant-  do  not  lead  to 
better  decisions,  they  may  greatly  facilitate  the  de¬ 
cision  process  and  -from  a  practical  point  of  view  - 
remove  some  load  from  the  computing  machines  employed. 

Concerning  the  problem  of  distributed  hypothesis 
testing,  we  have  shown  that  it  becomes  hard,  once  a 
simplifying  assumption  of  conditional  independence  is 
removed.  This  explains  why  no  substantial  progress 
on  this  problem  had  followed  the  work  of  Tenney  and 
Sandell  [1982] . 

From  a  more  general  perspective,  we  are  in  a  posi¬ 
tion  to  say  that  the  basic  vand  the  simplest)  problems 
of  decentralized  decision  making  are  hard,  in  a  precise 
mathematical  sense.  Moreover,  their  difficulty  does 
not  only  arise  when  one  is  interested  in  optimality. 
Difficulties  persist  even  if  optimality  is  replaced 
by  satisficing.  As  a  consequence,  further  research 
should  focus  on  special  cases  and  easily  solvable 
problem  as  well  as  on  approximate  versions  of  the 
original  problems. 

In  cases  where  communications  are  necessary  (hut 
costly)  there  arises  naturally  the  problem  of  desig¬ 
ning  a  protocol  of  conmunications.  Unfortunately,  if 
this  problem  is  approached  with  the  intention  to  mi¬ 
nimize  the  amount  of  communications  that  will  guar¬ 
antee  the  accomplishment  of  a  given  goal,  we  are  again 
led  to  intractable  combinatorial  problems.  Therefore, 
practical  communications  protocols  earn  only  be  desig¬ 
ned  on  a  "good"  heuristic  or  ad-hoc  basis,  and  they 
should  not  be  expected  to  be  optimal;  approximate 
optimality  is  probably  a  more  meaningful  goal.  Again, 
allowing  some  redundancy  in  on-line  communications 
nay  lead  to  substantial  savings  in  off-line  computa¬ 
tions  . 
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Summary 

A  control  problem  with  a  system  modeled  as  a  nonde- 
termir.istic  finite  state  machine  is  considered.  Several 
agents  seek  to  optimize  the  behavior  of  the  system 
under  a  minimax  criterion,  with  each  agent  having  dif¬ 
ferent  information  about  the  system  state.  The  non- 
deterministic  model  of  uncertainty,  combined  with  the 
minimax  criterion,  lead  to  equivalence  relations  on  the 
past  input /cutput  histories  of  each  agent  which  genera¬ 
te  simple  sufficient  statistics  for  the  optimal  control 
laws.  This  sheds  light  on  the  basic  nature  of  decentra¬ 
lized  control  and  permits  complete  solution  of  a  parti¬ 
cular  class  of  problems - 

I .  Introduction 

A  system  to  be  controlled  is  assumed  to  be  described 
as  a  finite  state,  nondeterministic  automaton  with  in¬ 
puts  supplied  oy  several  control  agents.  Each  agent 
receives  an  observation  at  each  discrete  time  step 
which  indicates  a  set  in  which  the  current  state  must 
lie.  Each  observation,  coupled  with  knowledge  of  the 
s"ystem  structure,  can  lead  to  inferences  about  the 
past  system  behavior,  hence  about  the  past  observations 
of  other  agents  and  thus  predictions  of  other  agents' 
decisions.  This  simultaneous  interweaving  of  inference 
by  the  agents,  as  each  deduces  the  potential  actions  of 
others,  and  the  deductions  of  others  about  itself,  etc. 
leads  to  3ome  of  the  complexity  of  the  analytical 
process  for  general  decentralized  problems. 

Based  on  this  model  structure,  an  approach  to 
addressing  the  problem  of  designing  the  optimal  (in  the 
appropriate  worst-case  sense)  decision  rules  has  been 
developed.  The  approach  is  based  on  the  identification 
of  the  set  of  sufficient  statistics  for  each  agent  to 
use  and  the  dynamic  relations  between  them;  these 
sufficient  statistics  are  no  more  than  the  intertwined 
deductions  of  the  agents  about  each  other  truncated  at 
the  point  where  they  are  no  longer  productive .  The  set 
of  these  statistics  form  an  extended  state  space  over 
which  dynamic  programming  may  be  used  to  derive  the 
optimal  decision  rules. 

II.  Notation  and  Problem  Formulation 

A.  Notation 

This  work  will  use  set  valued  functions 

f:X  +  2V  (2.1) 

to  model  nondeterministic  behavior,  i.e.  y  8  j  •  f(x) 
indicates  that  ar.y  eirnent  y  of  the  set  yCY  may  arise 
as  a  result  of  applying  £  tt  the  point  x.  "The  exten¬ 
sion  of  £  to  a  function  on  the  power  set  of  X 

£e  :  2*  -  2*  (2.2) 
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where 

f  (x)  -  (JHx)  (2.3) 

X€X 

will  not  be  distinguished  from  f_  itself.  The  preimage 
of  y  under  will  be  denoted  by 

X-  f‘X(y)  >  (x|yef(x)}  (2.4) 

£  X  will  not  be  distinguished  from  its  extension  to 

f'^Cy)  =  {xi’^Af 'X(x)  /  0}  (2.5) 

Finally,  an  cost  functional 

J  s  X  -*  K  (2.6) 

will  have  an  extension 

J  (x)  ■=  max  J(x)  (2.7) 

~  xex 

which  will  also  not  be  distinguished  from  its  restrict¬ 
ion,  J. 

Subspripts  will  indicate  the  agent  associated  with 
each  variable.  Superscripts  will  indicate  elements  of 
Cartesian  product  set;  e.g.  xfc  »  (x(l) , . . . ,x(t) )e  Xt. 

B.  Dynamics 

The  system  to  be  controlled  will  have  a  finite  state 
space  X  with  N  elements  and  dynamics 

x(t+l)  6  f(x(t>,  ux(t),  u2(t))  (2.8) 

The  state  and  its  dynamics  may  be  taken  to  include  any 
jnteragent  communication  mechanisms.  The  initial  star? 
is  assumed  fixed,  known  to  all  agents  and  is  denoted 

x  . 
o 

Observations 

yx(t)  e  hi  (x(t))  (2.9) 

are  available  to  each  agent  at  each  time  just  before 
u^tt)  is  to  be  selected. 

Each  agent  is  assumed  to  have  perfect  recall  of  oil 
past  observations  and  decisions.  The  decision  rule  by 
which  u^ (t)  is  selected  is  restricted  to  being  a  causii 
function  cf  the  local  information  sequence 

Y  (•  ft)  :  -►  u.  (2.10) 

1  i  i 

T 

I\  represents  the  entire  sequence  of  decision  rules 
for  agent  i. 

C.  Objective 

A  cost  function  is  defined  as 

J:  X  XUX  xUj  ♦  JR  (2.11) 

Taken  together,  the  dynamics  (2.8-2.10)  and  control 
T  T 

laws  r  ,  r  recursively  define  a  set  of  possible  joint 
state,  control,  and  information  trajectories. 
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;fine 


J(rT,r?)  »  max  max  J(x(t),u.  (t),ujt)) 

4  *  teio . t}  x  ( t )  1  2 

u2(t) 

nere  the  x(t) ,  u^ltl/U^ft)  range  over  all  values 
-intly  possible  at  time  t  as  determined  by  this 
acursion.  The  overall  objective  is  to  minimize  this 
unction. 


I I .  The  Information  Relation 

.  Definition 

Begin  oy  considering  the  autonomous  case. 
efi-ition:  The  global  condi tronal  state  set  at  time  t 

denoted  X'tl^lX,  is  the  set  of  all  Possible  states 
which  the  system  may  occupy  at  time  t  and  whi-li  ray 
be  reached  along  a  trajectory  xf  which  is  poss'Lle 
given  both  agents'  observation  sequences. 

This  global  conditional  state  set  is  analogous  to 
he  Markov  conditional  state  distribution;  it  may  be 
omputed  recursively. 

.emma  1:  The  global  conditional  state  set  may  be 
computed  from 

x(0)  =>  (x  } 

®  •  l  —  l 

xit+l)  =  f ix(t))^>h  A(y  (t+l)>r\h  A(y,(t+1)) 

(3.1) 

’roof :  Set  manipulations. 


;V.  Information  Relations 
4.  Concept 

Jefrnitior:  An  information  relation  R  for  a  two  agent 
pioblem  is  a  function  from  two  sets  Z  (t)  and  Z2(t) 
to  the  power  set  of  another  set  X.  For  autonomous 
systems,  R  is  a  primitive  information  relation  if 

Z,  (t)  »  Y?,  Z.(t)  *  ySx  is  the  state  space,  and 
1  l  «.  2 

R(yJ>  V2>  *  •  (4.1) 

The  motivation  for  studying  the  information  re¬ 
lation  is  to  reduce  a  primitive  relation  to  one  of 
smaller  dimension  which  still  serves  for  the  generation 
of  optimal  decision  rules.  The  generic  sets  Z.  which 
comprise  any  relation  will  be  aggregations  of  the 
primitive  information  sets  .  The  remainder  of  this 
section  establishes  the  algebraic  structure  of 
information  relations. 


B.  Humomorphisms 

The  global  conditional  state  sets  have  a  lattice 
structure  superimposed  on  them  by  the  set  containment 
relation.  Information  relations  have  a  similar 
structure  which,  while  not  a  lattice,  is  (almost)  a 
partial  order. 


Definition:  A  homomorphism  4  from  R,  defined  from  Z^ 
and  Z2  to  2X,  to  R'  ,  defined  from  Z^,  z'  to  2X, 
is  a  "‘pair  of  functions  (42<42>  4 


4  :  Z 
V1  1 


(4.2) 


4  :Z  -»  Z' 

V  2  2 


satisfying 

R<Zl'Z2,ClR'  (41(*1>  'f2(z2)  1 
RlijhjlCR'  (4,  <*.,)  42U2>) 


V  =2  6  z2 


V  Z1  6  Z1 


(4.3) 


Definition:  An  information  relation  R  is  contained  in 
another  R*  (R^Hf)  if  there  is  a  homomorphism  from 
R  co  R1  . 

The  containment  relation  is  clearly  reflexive  (if  $ 
is  the  identity  map)  and  transitive  (through  composi¬ 
tion  of  $.  ’a)  .  it  thus  imposes  a  structure  on  the  set 
of  information  relations  which  is  partial  order  on 
equivalence  classes,  where: 

Definition:  •  R  is  equivalent  to  R*  (  R  ^R’) 
if  R<C  R’  and  R^R'. 

Brth  the  partial  order  and  equivalence  relation  have 
useful  interpretations  in  the  decentralized  problem. 

c.  Automorphisms 

This  section  focusses  on  the  equivalence S on  in¬ 
formation  relations. 

Definition:  An  automorphism  $  on  an  information  rela¬ 
tion  P  is  a  homomorphism  from  R  to  R.  If  bowh  ij 
and  are  1:1,  then  it  is  an  isomorphism. 

Otherwise,  it  is  a  reducing  automorphism. 

The  automorphisms  which  are  not  isomorphisms  ar  :  ot 
considerable  interest. 

Definition:  An  information  relation  R  is  irreducible 
if  all  automorphisms  on  R  are  isomorphisms.  Other¬ 
wise  it  is  reducible. 

Definition:  The  reduction  of  an  information  relation 

x 

R:  Z^  x  7>2  •*  Z  by  a  reducing  automorphism 

4  =  (42,42)  is  an  information  relation 

R*  :  <k(Z  )  X  42<Z,)  -*  2*  where 

R'  (4,  (2.)  ,40(zJ  )  »  R(4,  (z.),4,(z,))  (4.4) 

1122  lj.22 

D.  The  Core 

Every  i.. formation  relation  can  be  reduced  to  an 
irreducible  one  by  suitable  compositions  of  reducing 
automorphisms  (thus  oy  some  single  reducing  auto¬ 
morphism)  . 

This  notion  is  the  most  essential  part  of  this  work. 

Definition:  A  core  of  an  information  relation  R, 
denoted  core  (R)  or  R* ,  is  an  irreducible  infor¬ 
mation  relation  obtained  from  R  by  some  auto¬ 
morphism. 

The  core  has  a  number  of  interesting  properties. 

The  most  basic  rely  on  the  following  lemma. 

Le;.n:a  2 :  If  two  irreducible  information  relations  R  ^ 
end  R'  are  equivalent  under  Jj,  then  they  are  equal. 

Proof :  If  RS  R’,  then  there  is  a  homomorphism  4  frow 
R  to  R',  and  anotner,  O'from  R‘  to  R. 

Consider  the  composed  homomorphism  <T  from  R  to  R: 

VV  =  VW1  (4-5) 

42  U2)  =  42(42  (z2> ! 

Thit  $  is  indeed  a  homomorphism  from  R  to  R  is  shown 

by 


R<Z1,Z2!<S-  R'  (VZ1,,'’>2(22))  V  Z2  1  Z2  U'6) 

since  4  is  a  homomorphism  from  R  to  R',  and  in  turn 
R'  (tj/tj)  ,42ia2))CR(4{(41(z1))  .42<4*<?2)>) 

v  -2  e  z2  ;  1.7) 


1 


Up  to  isomorphism.  This  qualifier  will  be  left 
implicit  in  the  sequel. 


i 

l 


by  $'  being  a  homomorphism.  Thus  $  is  an  automor¬ 
phism  on  R,  and  in  fact  must  be  an  isomorphism 
since  R  is  irreducible.  However,  the  composition 
of  $  and  can  be  an  isomorphism  if  and  only  if 
$  and  $'  are  isomorphisms;  hence  R  and  R'  are  equal. 

This  lemma  immediately  gives: 

Theorem  1;  The  core  of  an  information  relation  is 
unique.  Moreover,  if  R«J  •  then 
core  (R)  *  core  (R‘ ) . 

Proof:  Any  core  (R) C  R,  since  a  homomorphism  from 

R  to  a  core  (R)  exists  by  definition,  and  a 
homomorphism  from  core  (R)  to  R  exists  by  con¬ 
struction:  core  (R)  is  a  (perhaps  relabeled) 
restriction  of  R  to  subsets  of  2  ;  construct  the 
homomorphism  from  elements  of  these  subsets  back  into 
their  original  values  in  Z^  an(j  z2.  If  two  cores, 
core  (R)  and  core  ' (R)  exist,  then 

core  (R)5  R  S  °ore  *  (R>  1 1.8) 

Both  jure  irreducible;  by  transitivity  of  ^  and 
lemma  2  they  must  be  equal,  hence  unique. 

Moreover 

core  care  (R*)  (4.9) 

similarly  implies  core  (R)  «  (R*). 

At  any  point  in  time  Z. (t)  in  R(t;  will  represert  a 
reduced,  perhaps  trivially,  version  of  Y*.  There  i_  a 
natural  way  to  extend  this  aggregation  to  its  counter¬ 
part  at  time  t+1. 

Definition:  The  expansion  of  an  information  relation 
R(t)  to  another  relation  R(t+1)  is  denoted 


R(t+1)  -  F (R(t) ) 
and  is  constructed  by  setting 


Z^t+1)  -  Zi(t)  x 


and 


R(t+1) [BL(t+i),s2(t+l)) 

*  R(t+1)  Uj^t)  ,y^)  ,*2(t)  ,y2'  ) 


(4.10) 


(4.11) 


(4.12) 


-  f(R(t)  U1(t),r2(t)])^h*11(y1)/°\h'21'(y2) 

The  structure  of  this  expansion  is  captured  in 

Lemma  3:  If  R(t)  is  the  primitive  relation  at  time  t, 

then  F(R(t))  is  the  primitive  relation  for  time  - 

t+1. 

Proof:  By  definition,  R(t) :  Y^  x  Y2  -  2X  ,  and 

R(t) (y^,y2)  »  x(t) ,  the  global  conditional  state 
4  *  t  t 

set  based  on  (y^Yj).  Identifying  y^  in  (4.13)  with 
the  observation  y. (t+1)  yields 

R(t+l)(y*+1,y*+1)  = 

-  f(x(t))/^h*^(y1(t+l))^h‘21(y(t+l))  (4.13) 

-  x(t+l) 

by  (4.1).  Thus  R(t+1)  is  a  primitive  information 
relation. 

Lemma  4:  If  R£2R‘,  then 
F(R)^  F(R') 


(4.14) 


Proof:  Since  R^R',  there  exists  functions  ^:Z^+  Z^ 
satisfying  (4,4).  Construct  functions 


,  x  •/.+■  z;  x  Y,  where 
i  -  i  i 


VW 


(4.15) 


Then  for  all  z2  and  y2 


(4.16) 

(4.17) 

(4.18) 


F(R((z1»y1) ,Z2(y2))) 

*  {(RtZj^.ZjD/^h  (Yj) 

-  f  (r-  (^.z^  ,$2  (z2) )  )/~\ h’1  (y/^«»’2l(y2> 

by  (4.4)  and  set  inequalities,  and  this 

-  F (R* )  (^ (2j_,Y^)  #<^2  <Z2 ,Y2> 

This  establishes  the  first  half  of  (4.4)  for  '$+; 
the  other  half  is  shown  by  a  symmetric  argument. 
Thus  $+  is  a  homomorphism  from  F(R)  to  F(R'); 
hence  F(R)C  F(R’) . 

This  sets  up  the  second  major  result: 

Theorem  2:  Let  R(t)  and  R(t+1>  be  primitive  inform¬ 
ation  relations  at  successive  times.  Then 


core  (R(t+1) )  »  core  (F(core  R(t)>) 
Proof:  By  Lemma  3. 

F (r (t) ) 


(4.19) 


(4.20) 


R(t+1) 

From  Theorem  1 

core  (R(t))^R(t)  (4.21) 

Hence  applying  leiraia  4  to  each  containment  in  (4.21) 

F(core(R(t)))jS  F(R(t))  (4.22) 

Then  by  Theorem  1  and  (3.2n) 

core  (F(core  R(t))))  ■-  core(R(t+l))  (4.23) 

Definition:  The  steady  state  core  information  relation 
R*,  if  it  exists,  is  the  core  of  some  primitive 
information  relation  R(t)  and  satisfies 


r*  rn  core  (F (R* ) ) 


(4.24) 


If  a  steady  state  core  can  be  found  for  a  system, 
then  a  great  deal  of  the  work  required  to  solve  the 
system  is  complete.  However,  not  all  systems  have  a 
steady  state  core. 

The  core  dynamics  alter;. tely  expand  Z^(t)  by 

appending  Y.,  a  new  observation,  and  then  reduce  it  via 
an  equivalent  relation  implied  by  an  automorphism  - 
The  equivalence  relation  combines  those  elements  of 
3.  x  Y^  which  need  not  be  distinguished  in  the  future 

as  far  as  the  core  dynamics  are  concerned.  The  entire 
sequence  of  these  equivalence  relations  mao  every  ob¬ 
servation  sequence  y^  into  some  element  of  Z^lt),  and 

thus  dictate  the  structure  of  a  finite  state  machine 
with  V  as  an  input  set  and  Z^ (t)  as  the  states. 

Definition:  The  local  core  observer  for  agent  i  is 
the  system  with  state  set  Z. (t)  and  dynamics 


zi(t+l) 


fi(zi(t),  yi(t+l)) 
$1(zi(t),  y.(t+l)) 


(4.25) 


where  the  Z. (t)  are  the  components  on  which  R*(t)  is 
defined,  an4  <p.  is  a  component  of  the  reducing 
automorphism  used  to  reduce  F(R*(t))  to  R*(t+1). 

Thus  the  core  wnamics  define  some  automata  for 
processing  local  observations  in  a  way  which  maintains 
the  relationship  described  in  R*. 
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V.  Decentralized  Estimation 


A.  Problem 


The  general  decentralized  estimation  problem  is  to 
find  the 


from  the  definition  of  a  homomorphism.  The  y  defi¬ 
ned  in  (4.9),  must  achieve  a  value  no  larger  than 
J*(R'),  and  the  best  y  is  at  least  as  good  as  this 
one,  so  the  conclusion  holds.  This  immediately 
suggests  the  third  major  result: 

Theorem  3 :  bet  R(t)  be  a  primitive  information  rela¬ 
tion  and  R*  (t)  its  core. 


..  .  r  \ 


min  max 


_T  -T  t€  1,...,T  x(C)possible 
1  2 


J(x(t)  , Uj. ( t)  «u2(t) ) 


where  x(t)  is  possible  if  x(t)  6  f_  (>'Q) ,  (the  t-fo.'.d 
composition  of  with  itself).  The  information 
restriction 

u^ti  -  Yi(t)  (y£)  l5-2 

still  applies. 


J(Y,  (t)  ,y.(t)  -  max  J(x(t),Y,  (x^,Y,(yb) 
1  2  x(t)\ 


'1  /possible 

t  I 


Each  component  of  IV  may  be  chosen  separately,  since 

decisions  do  not  affect  dynamics  or  costs  other  than 
that  a  single  time;  J  depends  only  on  decision  rules 
at  one  time.  Thus 

lemma  5:  The  solution  to  the  decentralized  estimation 
problem  may  be  found  by  salving  the  sequence: 

minimize  J  (y,  (t) . y  (t) )  (5.4) 

Y1(t),Y2(t) 

B.  Use  of  the  Information  Relation 

Definition:  The  optimal  value  of  an  information 
relation  R  in  a  nondeterministic  decentralized 
estimation  problem,  denoted  J* (R) ,  is 

min  max  Jlx.'Y^U^)  •  Y2  <z2 J  J  (5.5) 

YiY2  t1€21 


J* (R(t) )  -  J* (R* (t) )  (5.11) 

Proof:  From  theorem  1,  »(t)S8*.  Applying  Lemma  6  in 
both  directions,  the  conclusion  follows. 

VI .  The  Control  Prob.cm. 

A.  Information  Dynamics 

The  definitions  of  information  relation,  homomor- 
phisms,  containment,  and  cores  established  in 
section  IV  carry  over  to  the  control  problem  without 
change.  The  expansion  process  is  the  only  place  where 
new  information  relations  ire  generated,  so  the 
influence  of  decisions  on  dynamics  requires  a  modifi¬ 
cation  there. 

Definition:  The  expansion  of  an  information  relation  R 
by  dec'  sion  rules  y^  :Z^  -*  IL  is  an  information 

relation  R',  where 

R*  -  F(R,Y1.Y2>  <61> 

k'  :  Z^  X  Zj  -*■  2X  (6.2) 

Z‘  »  Z  x  V.  (6.3) 

ill 

R‘(lzlyl) 'U2,y2M  “  f(R(*1.*2).Y1«*1)Y2(2i)) 

16 -4) 

Definition:  Let  $  be  a  homomorphism  from  R  to  R* . 
Then  a  decision  rule  :  Z -  U£  is  contained  in 

yi  “  zi  *  °i’  denoted  • 


Yi(*i>  *  Yi  (♦i  t*i>  > 


V  zi  e  Zi 


x6R(z1>z2) 

with  Z2  and  Z2  the  sets  on  which  R  is  defined,  and 
the  restriction. 

y.  :  Z.  ♦  u.  (5.6) 

i  i  i 

Lemma  6:  If  R  and  R1  are  information  relations,  then 
RC  R'z3>  J*(R)  <  J*(RM  (5.7) 

Proof:  If  RC#',  then  a  homomorphism  t  -*  (<^2_»'1>2) 

exists  from  R  to  #’.  Consider  any  strategy 

Y1  -  (Y2<Y2)  with 


Build  a  strategy  y-  ^  ,y2>  with 

Ya  <*i*  “  (j  ($£(*>> 

Then  the  inequality  stems  from  the  fact  that 


With  this  notion,  the  results  of  section  IV  gene¬ 
ralize  to  the  control  case  as: 

Theorem  4:  Let  R  and  R’  be  information  relations,  and 
y .  ,y'2  decision  rules  of  the  appropriate  structure. 

a)  if  R  is  a  primitive  information  relation,  then 
so  is  F (r,y27y2> 

b)  if  RCR1,  Y:OT2  and  Y2CY2>  all  by  the  3ame 
homomorphism,  then 

f(r,yx,y2)<c;  f(R’,y^,y2)  l6-6) 

o)  If  R  is  a  primitive  information  relation,  then 
core  (R(R,y, ,y_)  *  core  (F(core(R) ,y. »Y-) ) 

12  12  (6.7) 

Proof:  All  procfs  are  direct  extension  of 

a)  Lemma  3 

b)  Lemma  4 

c)  Theorem  2,  using  (a)  and  (b) . 


RUj^Zj)^  R’  (z, )  ,$2(z2)) 


A 

\ 
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B.  Costs 

Properties  developed  in  Section  IV  for  the  estima¬ 
tion  problem  also  generalize  to  the  control  problem. 

Definition:  The  cost  of  an  information  relation  R  wi.h 
compatible  decision  rules  Y^»Y2  *-s 

=  max  J  (x,y  (z^)  (z^ )  (6.8) 

2i 

2., 

x6R(.:1#z2) 

The  overall  problem  objective  (2.13)  becomes 

JIT  ,T  )  *  max  max  J (x(t) ,Y  (y5) #Y0 (y^) ) 

1  2  te{l,...,T)}  t  1122 

h 

t  (6.9) 

V2 

xetRfy^y*) 

=  max  J(R(t),y  (t),y  (t))  (6.10) 

teil, . . . ,tJ 

where  the  dependence  of  R(t)  on  prior  decision  rules 
is  left  implicit.  Thus  the  overall  objective  can  be 
written  in  terms  of  costs  of  primitive  relations  with 
decision  rules. 

The  final  result  needed  is: 

Theorem  5:  Let  R  and  R' ,  be  information  relations, 
with  Y^>Y2  atKi  Yj^Yj  compatible  decision  rules. 

Then 

a)  If  rCI  R'  ,  '(jC  Y|<  end  yn<£Zy'2  by  the  same 
homomorphism  4*,  then 

j(r,y1.y2)  i  hr',)].  y'2)  (6.ii) 

b)  If  K*  =  core  (R'KYjS'lP  and  Y-,C>2  bY  thc 
same  homomorphism  4,  then 

J ( R*’,  ^ , Y 2 >  =  J(R'  ’ Y p Y p  (6-12) 

Proof :  Also  direct  extensions  of  previous  results. 

a)  Lemma  6 

b)  (a)  with  Theorem  1. 

C.  General  Solution 

Let  R* (t)  -  core  (R(t)),  where  R(t)  is  the 
primitive  information  relation  created  by  decision 
rules  prior  to  cime  t.  From  Theorem  4<:,. 

K*(t+1)  =  core  (F (R* (t) ,y L (t) , {  ,  ( t) ) )  (6.13) 

=  F*(R*(t),Y1(t)/Y2(t) 

and  the  overall  objective  is 

J<r.,r  )  -  max  J(R*  (t)  ,Y,  (t)  ,Y,(t)  (6.14) 

.  1  2  t6(l . T)  1 

whore  V ^  and  are  now  sequences  of  decision  rules 

whose  arguments  are  in  Z^(t)  and  2^(0  respectively. 

Solution  of  tnis  problem  would  involve  straight¬ 
forward  minimax  dynamic  programming  if  the  set  in 
which  the  R* (t)  could  like  were  determinable  ahead  of 
time.  Unfortunately,  it  is  not  known  how  to  do  this  at 
this  point;  however,  ore  may  construct  the  set  of  all 
cores  reachable  under  all  from  R* (o)  and  take 

this  to  be  the  requisite  set. 


242 


AD  P002 


A  MINIMUM  SENSITIVITY  INCENTIVE  CONTROL  APPROACH  TO  TEAM  PROBLEMS* 


Tamer  Bagar,  Jose  B.  Cruz,  Jr.,  and  Derya  Cansever 
Department  of  Electrical  Engineering  and 
Coordinated  Science  Laboratory 
University  of  Illinois 
1101  U.  Springfield  Ave. 

Urbana,  Illinois  11801 


\  ABSTRACT 

Injthis  paper  we  analyzers  class  of  two-agent  team 
de-  ision  problems  with  a  hierarchical  decision  struc¬ 
ture,  wherein  one  of  the  decision  makers  may  have  a 
slightly  different  perception  of  the  overall  team  goal, 
with  this  slight  variation  not  known  by  the  other  agent 
who  is  assumed  to' occupy  the  hierarchically  dominant 
position.  The  leading  agent  has  access  to  dynamic  in¬ 
formation  and  his  role  is  to  announce  such  a  policy 
(incentive  scheme)  which  would  lead  to  achievement  of 
the  overall  team  goal,  in  spite  of  the  slight  varia¬ 
tions  in  the  other  agent's  perception  of  that  goal, 
which  are  not  known  or  predictable  by  him.  We  may  call 
a  policy  with  such  an  additional  feature  a\  "minimum 
sensitivity"  incentive  policy.  We  obtain, \in  the  paper, 
"minimum  sensitivity"  policies  for  the  leading  agent, 
for  a  general  cost  functional  with  convex  structure, 
which  are  least  sensitive  to  variations  in  nhe  following 
agent's  perception  of  the  team  goal.  In  some  special 
cases,  we  show  that  the  robust  feature  of  the  incen¬ 
tive  scheme  is  maintained  regardless  of  the  n^agnitude 
and  nature  of  the  variations,  and  illustrate  the  theory 
with  an  example  arising  in  armament  limitation  and 
control.  \ 

I.  INTRODUCTION 

The  main  characteristic  of  team  decision  problems 
is  the  presence  of  several  decision  makers  with  a 
common  objective  functional  which  is  to  be  optimized 
jointly  (but  possibly  in  a  decentralized  fashion)  by 
all  decision  makers.  An  underlying  stipulation  in 
research  on  team  theory  has  been  the  assumption  that 
all  agents  perceive  the  common  goal  in  exactly  the  same 


Zheng  and  Ba$ar  (1982)],  which  involve  a  hierarchy  in 
decision  making  and  a  suitable  information  structure 
for  the  decision  maker  at  the  top  of  the  hierarchy, 
that  allows  him  to  design  a  policy  which  in  its  turn 
induces  the  other  decision  maker  with  a  different  ob¬ 
jective  functional  to  behave  in  a  desired  manner. 
Recently  in  [Cansever  and  Bagar  (1982)],  optimal  incen¬ 
tive  schemes  have  been  used,  within  the  context  of 
Stackelberg  games,  to  minimize  the  effect  of  changes 
in  the  parameters  of  the  follower's  cost  functional  on 
the  leader's  optimum  cost  value,  by  simultaneously 
achieving  a  desired  goal.  Here,  we  direct  our  atten¬ 
tion  to  problems  which  are  nominally  team,  and  derive 
incentive  schemes  that  are  least  sensitive  to  devia¬ 
tions  in  the  hierarchically  inferior  decision  maker's 
perceptions  of  the  uncertain  parameters.  The  fact  that 
the  underlying  goal  is  common  (that  is,  the  nominal 
optimization  problem  is  a  team  problem — a  property  that 
may  be  destroyed  in  the  decision  process)  can  be  ex¬ 
ploited  to  obtain  very  appealing  minimum  sensitivity 
strategies,  as  we  will  show  in  the  sections  to  follow. 

The  problem  is  formulated  in  Section  II.  In 
Section  III  we  introduce  sensitivity  functions  and 
obtain  robust  affluc  strategies  for  a  general  class  of 
convex  cost  functionals.  In  Section  IV  we  provide  a 
geometrical  interpretation  for  total  insensitivity  when 
the  objective  functional  is  affine  in  the  unknown 
parameter.  Section  V  deals  with  the  generalization  of 
some  of  these  results  to  the  multiparameter  case,  and 
Section  VI  illustrates  the  basic  ideas  developed  in 
this  paper  using  a  model  from  armament  limitation  and 
control.  Concluding  remarks  of  Section  VII  end  the 
paper. 


way,  and  face  exactly  the  same  mathematical  optimiza¬ 
tion  problem  [Marschak  and  Radner  (1972)].  In  this 
paper  we  relax  this  basic  assumption  and  allow  (in  the 
context  of  two-agent  problems)  one  agent  to  have  a 
somewhat  different  perception  of  the  common  goal  and 
to  quantify  it  in  a  slightly  different  way.  Further¬ 
more,  we  will  assume  that  the  other  agent  is  not  in¬ 
formed  of  the  existence  of  this  discrepancy  in  the 
perception  of  the  common  goal,  but  is  able  to  monitor 
the  decision  of  the  former  by  occupying  a  higher 
(dominant)  position  in  the  decision  process.  The 
problem  we  address  to  is  the  design  of  a  suitable  stra¬ 
tegy  for  the  agent  who  occupies  the  hierarchically 
superior  position  and  who  still  adopts  the  original 
team  objective  functional  as  his  own,  such  that  the 
change  in  the  minimum  value  of  the  team  cost  because  of 
the  discrepancy  in  the  perceptions  of  the  common  goal 
is  kept  to  a  minimum.  Ideally,  the  hierarchically 
superior  member  of  the  team  would  seek  not  to  be 
affected  by  this  discrepancy,  if  this  is  at  all 
possible. 

Ve  will  approach  this  problem  using  optimum  incen¬ 
tive  design  schemes  [Ho,  Luh,  and  Olsder  (1982)  and 
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II.  PROBLEM  FORMULATION 

Consider  a  two-person  deterministic  team  decision 
problem  in  normal  form,  described  by  the  cost  func¬ 
tional  .I(Yi  ,v2,o),  where  y^er^  denotes  the  strategy  of 
DM1  (i'th  decision  maker)  and  oCACR  is  a  parameter 
on  which  the  cost  functional  depends.  Let  u€U  - Rn , 
v€V»Rm  denote  the  decision  variables  of  DM1  and  DM2, 
respectively ,  and  assume  that  rj»{y:V-*U},  I^-V;  i.e. 
DM1  has  access  to  the  decision  value  of  DM2.  DM1  also 
knows  the  precise  value  of  the  parameter  a  (say  a°) , 
whereas  DM2  perceives  its  value  differently  (say  a+€A), 
which  in  turn  gives  rise  to  a  different  cost  functional 
from  his  point  of  view,  namely,  J(Yi»Y2»“+)  t  J (Y 1  •  Y2 *a°). 
Furthermore,  DM1  does  not  know  the  exact  value  per¬ 
ceived  by  DM2,  but  his  ultimate  goal  is  to  see  that  the 
lowest  possible  value  is  attained  for  J(yj ,Y2,a0) .  The 
decision  structure  of  the  problem  is  assumed  to  be 
hierarchical,  in  the  sense  that  DM1  is  the  dominant 
decision  maker  and  has  the  power  and  means  of  declaring 
his  policy  in  advance  and  enforcing  it  on  the  other  DM. 
Hence,  while  DM2  is  faced  with  the  problem  of  minimi¬ 
zing  J(yi(v)  ,v,o+)  over  v€V,  DM1  wishes  to  choose  a 
Y  T j  (in  total  Ignorance  of  a+)  that  would  eventually 
lead  tc  a  minimum  value  for  J(yi(v) ,v,a°) . 

By  an  abuse  o 2  notation,  let  J(u,v,a)  denote  the 
cost  functional  on  the  product  space  UxV,  for  each 
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a€A,  and  assume  Chat  Chis  functional  la  strictly  con¬ 
vex  on  U*V,  for  each  iCA,  la  twice  continuously  dif¬ 
ferentiable  In  Its  first  two  arguments  and  continuously 
differentiable  In  Its  third  argument.  Furthermore,  let 
us  denote  the  unique  minimum  of  J(u,v,a°)  by 
(ut,vt)C  B*V.  Restricting  DM1  to  affine  policies  in 
Tj,  we  first  note  that  the  policy 


YjCv)  -  ut  +  P(v-vt) 


where  P  is  an  (n*m)-matrix,  has  the  appealing  pro¬ 
perty  that  if  DM2 'a  perception  of  a  Is  a0 ,  then 
min  J(yi(v),v,o°)  leads  to  the  desired  value  vc£V  for 

any  matrix  P.  If  a+ 1*  a° ,  however,  the  problem  ceases 
to  be  a  cooperative  one  since  the  problem  faced  by  DM2 

la 

min  J(y. (v) ,v,a+)  (2a) 

vCV  1 

whose  minimizing  solution  (say  v+£  V)  satisfies  (and  la 
uniquely  determined  by)  the  equation 

Ju(u+,v+,a+)P  +  Jv(u+,v+,a+)  «0  (2b)15 

where  u+  ■  T j  (v*-)  •  u*  +  P (v+-ve)  and  is  not  necessarily 
the  same  as  uc .  The  problem  we  address.  In  the  sequel, 
is  whether  it  Is  possible  to  choose  a  robust  policy  Yj 
(by  choosing  P  appropriately)  so  that  either  u+«uc  and 
v+“  vl,  or  the  discrepancies  will  be  small  whenever  a** 
la  close  to  a°;  in  other  words,  we  seek  either  total 
Insensitivity  or  minimum  sensitivity  of  the  optimum 
value  of  J(u,v,a°)  to  variations  In  the  perception  of 
DM2  (of  a)  by  a  proper  choice  of  y j. 

III.  INTRODUCTION  OF  A  SENSITIVITY  FUNCTION  AND 
DERIVATION  OF  MINIMUM  SENSITIVITY  SOLUTIONS 

Aa  a  measure  of  the  sensitivity  of  J(u,v,a°)  with 
respect  to  deviations  in  the  perception  of  DM2  of  a 
from  Its  nominal  value  a°,  let  us  introduce  the  total 
derivative  of  J(u,v,a°)  with  respect  to  a,  when  u  •  u+, 
v  ■  v+.  satisfying  (2b),  and  at  the  point  a+«  o°.  We 
call  this  function  the  first-order  sensitivity  function 
of  J(u,v,o°)  with  respect  to  a,  at  a«a°,  in  view  of 
(1)  and  the  optimal  response  of  DM2  as  characterized 
(uniquely)  by  (2b) : 

lj<<x°)  - dJ(u+,v+,a°)/da+|o+_ao 

«  (dj/dv+) (dv+/da+) |a+„ao 

•  IJ-(«t.vt.«0)P  +  Jv(*t.Tt,a°)]v*,  (3; 

where 

v£ ■  dv+(a+) /do+)  . 
a  +0 

'a  *a 

and  Is  determined  from  (2b).  To  obtain  an  expression 
for  v£,  we  note  that  (2b)  is  in  fact  an  identity  for 
all  a+€A,  since  it  uniquely  determines  the  optimal 
response  of  DM2  to  the  announced  policy  (1)  of  DM1, 
with  his  perceived  value  for  a  being  a+.  Hence,  dif¬ 
ferentiating  (2b)  with  respect  to  a  ,  and  evaluating 
the  resulting  expression  at  a+  *  a° ,  we  obtain 

[P*J  P+P'J  +J  P  +  J  )vC  +  [J  P  +  J  ]  »  0  (4) 

uu  vu  uv  w  a  1  ou  av  '  ' 

whereby 

v£  -  -(P’J  P  +  P'J  +J  P  +  J  P  +  J  ]  (5) 

a  uu  vu  uv  w  au  av 


'Here  Ju  and  Jv  are  tow  vectors  of  dimensions  l*n  and 
l*m,  respectively,  denoting  the  partial  derivatives 
with  respect  to  the  corresponding  decision  variables. 


where  the  arguments  ate  evaluated  at  j"u£,  v  ■  vC, 
a*  a0.  Note  that  the  required  lnver8e  in  (S)  exists 
under  the  initial  hypothesis  that  J  Is  strictly  convex 
in  (u,v)  for  all  a£A. 

Now,  since  the  pair  (uC,vC)  globally  minimizes 
J(u,v,a°) ,  we  already  know  that 


Ju(ut.vt,a°)  -0, 


Jv(ut,vt,a°)  ■  0, 


in  view  of  which  the  first  product  term  of  (3)  and 
hence  Ij(u°)  vanish.  Then,  the  dominating  term  in  the 
Taylor  expansion  of  J(u+,v+,a°)  around  a+“a°  Is  deter¬ 
mined  by  the  second-order  sensitivity  function: 

2. 

I,(a°)»d2J(u+,v+,a°)/da+  +  o 

Z  (a  *a 

2 

«  { (dv+/da+/(d2J/dv+  ) (cv+/da+) 

2 

+  (dJ/dv+) (d2v+/da+  )]  +  g 
i  ci 

-vC  [P’J  P+P'J  +J  P+J  )v£+[JP+J  Jv6  .  (7) 

a  uu  vu  uv  w  a  u  v  aa 

Since  the  second  term  is  zero.  In  view  of  (6),  ^(a0) 
vanishes  If  and  only  If  v£»0,  which  requires  from  (S) 
that  there  exist  a  P  satisfying 


J  (uC,vC,a°)P  + J  (ut,vt,a°)»0 
au  '  av 


A  sufficient  condition  for  this  is,  of  course, 

Jau<“t.vt.“°>  *  0.  O) 

which  is  also  necessary  if  the  second  term  in  (8)  does 
not  vanish  (at  least  one  component  is  nonzero) . 

When  v£  vanishes,  not  only  the  second-order  sen¬ 
sitivity  function,  but  also  the  third-order  sensitivity 
function 

Ij(n°)  ■  d3J(u+,v+,a°)/da+  I  +  g  (10) 

a  «a 

vanishes,  because  it  carries  (by  chain  rule  of  dif¬ 
ferentiation)  only  terms  thet  involve  either  vj  or 
fJuP  +  JvI  aa  products.  Hence,  under  the  condition 
that  (8)  admits  at  least  one  solution,  and  when  DM1 
employs  the  corresponding  policy  (1),  if  DM2’s  percep¬ 
tion  a+  (of  a)  stays  within  an  e-neighborhood  of  its 
nominal  value  a°,  the  3rd  order  Taylor  approximation 
of  the  effect  of  this  discrepancy  is  zero.  We  now 
summarize  this  appealing  feature  of  the  linear  policy 
(1)  in  the  following  proposition. 

Proposition  1:  Let  condition  (9)  be  satisfied,  and 
let  P*  denote  a  solution  to  (8).  Then,  if  DM1  employs 
the  policy 

yJ(v)  -  ut+P*(v-vt),  (11a) 

and  the  unique  minimizing  decision  of  DM2  Is 

v+(a+)  •  arg  min  J(y*(v)  ,v,a+>  ,  (lib) 


J(Y*(v+) ,v+,a°)  agrees  with  J(ut,vt,a°)  to  third  order 
in  a+  when  it  lies  it  a  sufficiently  small  neighbor¬ 
hood  of  ac.  Equivalently,  the  discrepancy  in  costs  is 
of  fourth  order.  n 

When  the  objective  function  J(u,v,a)  is  affine  in 
a, we  can  obtain  more  explicit  results.  Specifically, 
let 

J(u,v,a)  »  g(u,v)  +  ah(u,v)  (12a) 
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vh*re  g  and  h  are  continuously  differentiable  in  their 
arguments, 

hu(ut.vt)  ,*  0,  (12b) 

and  J  is  strictly  convex  In  (u,v)  for  all  a£A.  Then, 
(8)  reads 

hu(ut,vt)P  +  hv(ut,vt)  -  0,  (13) 

a  solution  to  which  always  exists  because  (12b)  becomes 
equivalent  to  (9).  Hence  vj,  as  given  by  (5)  [evaluated 
at  o«a°]  is  zero.  This,  in  turn,  implies  through  an 
iterative  verification  that  the  vector  dnv+(a+)/da+n, 
where  v+(a+)  is  given  by  (lib),  vanishes  at  a+«a°,  for 
all  nsl,2,...,  simply  because  the  second  term  in  (4) 

J  P  +  J  «  h  (u,v)P  +  h  (u,v) 
au  av  u  V 

is  not  explicitly  dependent  on  a.  Since  the  nth  order 
sensitivity  function 

In(a°)  «  dnJ(u+.v+,a°)/da+  I  +  g  (14) 

'  a  «a 

14  1  4^ 

carries  only  (d  v  (a  )/da  )  ,  i“l,2,...,n  as  pro- 

a  “a 

duct  terms,  which  are  all  zero  whenever  P  is  chosen  to 
satisfy  (13),  it  follows  that  sensitivity  functions  of 
all  orders  vanish,  at  a+”a°.  Hence, 

Proposition  2;  When  the  objective  function  is  given  by 
(12a),  under  the  condition  (12b),  let  P*  be  any  solu¬ 
tion  of  (13).  Then,  if  (11a)  is  employed  by  DM1,  the 
response  of  DM2  (i-e.,  (lib))  is  independent  of  a+, 
and  v+“vc.  Hence,  J(u+,v+,a°)  *  J(ut,vt,a°)  for  all 
o^A,  that  is  the  overall  performance  is  independent 
of  the  perception  of  DM2  regarding  the  value  of  a.  □ 

In  the  next  section  we  provide  a  geometric  inter¬ 
pretation  of  this  appealing  feature  of  the  linear 
policy  when  the  cost  function  is  an  affine  function  of 
the  parameter  a. 

IV.  GEOMETRIC  INTERPRETATION  OF  TOTAL  INSENSITIVITY 
WHEN  TIE  OBJECTIVE  FUNCTIONAL  IS 
AFFINE  IN  A  PARAMETER 

Let  the  objective  function  J  be  as  given  by  (12a) 
with  h  satisfying  condition  (12b).  Since  J  is  strictly 
convex,  the  team  solution  (ut,vt)  when  a  *  a°  is 
obtained  (uniquely)  from 

gu(ut,vt)  +  a0h(j(ut,vt)  -  0  (15a) 

g^OtSv*)  +  a°hv(ue, vt)  -  0.  (15b) 

Postmultlplylng  (15a)  by  P,  adding  this  to  (15b),  and 
taking  the  transpose,  we  have 

(P’g'+g')  +  a°(P'h’+h’)  -  0.  (16) 

U  V  U  V 

Pictorlally,  the  vectors  (P'g^  +  g^)  and  (P'h^  +  hy)  are 
oppositely  oriented  when  a0  is  a  positive  scalar. 
Clearly,  o°  is  the  ratio  of  the  magnitude  of  the  vector 
(P'g^  +  g^)  to  the  magnitude  of  the  vector  (P'h^  +  h^). 

If  DM1  chooses  P  such  that  (13)  is  satisfied,  then  the 
magnitudes  of  both  vectors  become  zero.  In  this  case, 
if  a°  is  replaced  by  af  a°  in  (16),  the  equation  would 
still  hold,  and  (uc,vc)  satisfies 

CP*«J  +  IV>  +  a(P'V+V)  -  0.  (17) 

Since  (17)  is  the  condition  used  by  DM2  to  optimize  v 
(see  also  (2b)),  he  will  choose  v»vc,  no  matter  what 
his  perceived  value  of  a  is.  Thus,  DM1  achieves  the 


team-optimal  solution  for  minimizing  J(u,v,a°)  by 
choosing  P  such  that  P'  transforms  the  vector  hJ  to 
(-H.')  : 

P'h^u'.v*)  -  -V (uW  (18a) 

This  same  choice  of  P'  transforms  g^  to  (-g^) : 

P’g^u'.v1)  -  — g^fu'.v*).  (18b) 

V.  EXTENSION  TO  THE  MULTIPARAMETER  CASE 

In  the  previous  sections,  we  have  restricted  our 
discussion  tn  the  case  aCACR.  When  AC Rr,  the  first- 
order  sensitivity  function  Ij(a°)  becomes  a  (l*r) 
vector  given  by 


tj(a°)  -  dJ(u+,v+,a°)/da+|  +  g 

a  “u 

»  [Ju(ut,vt,a°)P  +  Jw(ut,vt,o°)]v^ 

(19a) 

where 

v1  -  -[P'J  P+P'J  +J  P+J  )-1lJ  P-KI  ] 
a  1  uu  vu  uv  vvJ  *  ua  va 

(19b) 

t  t  o 

and  the  arguments  are  evaluated  it  u*»  ,  v •  v  ,  a«a  . 
Note  that  lj(a°)"0,  in  view  of  (6).  Furthermore, 
v£ » 0  (zero  matrix)  if 

Jva  (u£ ,  v  C .  o°)  C  14(  Jua  (uC ,  v\a°)  )  (20) 

(where  R>  denotes  the  range)  since  then  it  is  possible 
to  find  an  (n*m)  matrix  P  to  make  the  second  product 
term  of  (19b)  zero.  In  this  case,  sensitivity  func¬ 
tions  of  orders  1,  2,  and  3  vanish  at  the  nominal  solu¬ 
tion  point;  hence,  affine  policies  have  very  appealing 
sensitivity  properties  also  in  the  multlparameter  case. 
When  condition  (20)  is  not  satisfied,  however,  one  has 
to  minimize  a  suitable  norm  of  the  leading  sensitivity 
function  with  respect  to  the  (nxm)  matrix  P.  This  ■’a, 
in  general,  the  second-order  sensitivity  functiona' 
l2(a°)  which  is  an  (r*r)  nonnegative  definite  mat  . lx. 

A  suitable  norm  for  minimization  is,  in  this  case, 

Tr{ I 2(0°) ! .  We  are  now  faced  with  an  unconstrained 
optimization  problem  on  P,  for  which  a  closed-form 
solution  does  not  in  general  exist;  however,  numeri¬ 
cally  it  is  a  feasible  problem. 

When  the  objective  function  J  is  affine  in  the 
parameter  vector  o£Rr,  a  total  insensitivity  result 
could  be  established  under  certain  conditions,  by  a 
direct  extension  of  the  discussion  of  Section  IT. 
Towards  this  end,  let 

J(u,v,a)  -  g(u,v)  +  a’h(u.v)  (21) 

where  g  :  UxV  +  R,  h  :  U*V-‘Kt,  J  is  strictly  convex  and 
continuously  differentiable  in  (u,v).  Then,  the  opti¬ 
mality  conditions  for  a  •a0  are 

gu(ut,vt)  +  a°'hi(ut,vt)  -  0  (22a) 

gv(ut,vt)  +  a0  hy(ut,vt)  -  0  (22b) 

where  hu  (respectively,  hy)  is  an  rxn  (respectively, 
rxm)  matrix.  The  optimal  response  of  DM2,  under  the 
policy  (1)  for  DM1,  Is  determined  uniquely  from  (for  a 
general  ct) 


r 


/ 


245 


\ 


•  / 


where  subscript  i  denotes  the  i'th  component  of  the 
corresponding  vector.  Now,  let  us  assume  that  there 
exists  an  m*n  matrix  P  satisfying  simultaneously 

hiu(ut,vt)P  +  h^fu^vS  -  0,  i«l . r.  (24) 

Under  this  condition,  the  second  term  in  (23)  vanishes 
ct  v«vl,  for  all  a£ACRr,  and  furthermore  the  first 
term  also  vanishes  in  view  of  (22a) -(22b),  by  basically 
following  the  argument  of  Section  IV.  Hence,  under 
this  particulav  choice  of  P,  v « vC  is  the  unique  solu¬ 
tion  to  (23)  far  all  values  of  a;  that  is,  the  optimal 
response  of  TM2  is  independent  of  his  perception  of 
the  value  of  a,  provided  that  strict  convexity  of  J  is 
preserved.  To  summarize, 

Proposition  3!  When  the  objective  function  is  given 
by  (21),  let  there  exist  a  solution  to  (24),  to  be 
denoted  P*.  Then,  if  the  policy 

y*(v)  ■  uC  +  P*(v-vC) 

is  employed  by  DM1,  the  response  of  DM2  (which  is  (lib) 
with  a+£  Rr)  is  independent  of  a+,  and  v+«v£. 
Consequently,  J(u+,v+,a°)  » J(ut,vt,a°)  for  all 
a+€AC«r.  □ 

In  the  next  section  we  consider  an  example  that 
Involves  arms  race  between  two  nations,  which  serves 
to  illustrate  some  of  the  ideas  generated  in  this  and 
the  previous  sections.  Another  example  from  micro¬ 
economics  can  be  found  in  [Cansever,  Bajar,  and  Cruz 
(1983)]. 

VI.  AN  EXAMPLE  FROM  THE  PROBLEM  OF  ARMAMENT 
LIMITATION  AND  CONTROL 


In  their  papers  on  armament  race  and  control 
[Simaan  and  Cruz  (1975a)  and  Slmaan  and  Cruz  (1975b)], 
Simaan  and  Cruz  have  modeled  the  arms  race  problem  as 
a  noncooperative  differential  game  between  two  nations. 
A  salient  feature  of  this  model  is  that,  when  the 
respective  cost  functionals  are  taken  to  be  quadratic 
in  the  decision  variables,  the  resulting  optimal  state 
trajectory  yields  a  discretized  version  of  the  armament 
model  proposed  earlier  by  Richardson  (Richardson  (I960)]. 
We  will  consider  here  the  case  when  the  two  nations,  DM1 
ard  DM2,  have  agreed  to  reduce  their  respective  armament 
expenditures.  Such  a  situation  inevitably  requires  the 
presence  of  an  element  of  cooperation  between  DM1  and 
DM2,  since  any  significant  departure  from  the  armament 
level  jointly  agreed  upon  may  eventually  lead  to  the 
original  high  armament  expenditure.  Towards  the  formu¬ 
lation  of  this  problem,  let  us  assume  that  the  goals 
of  the  Dll's  can  be  represented  by  two  objective  func¬ 
tionals  J^fx; ,X2>Uj,U2)  ,  i”l,2,  wherein  DMi  aims  to 
optimize  Jj.  In  order  to  incorporate  the  cooperation 
element  discussed  above,  we  will  adopt  the  Pareto 
optimal  equilibrium  concept,  which  will  be  realized 
[Schmitendorf  and  Leitmann  (1974)]  if  Che  DM's  jointly 
optimize 

J(xl,x2,ul,u2)  -V1(V*2.u1.u2)  +k2J2(x1,x2,u1,u2) 

(25) 

where  k1€  R+;  u€R+,  and  u,£ll+  denote  DMI  and  DM2's?) 
armament  investments,  respectively,  and  x^  represents 
the  armament  level  of  DMi,  1*1,2,  which  further 
satisfies 

*i  "  fi(xi0,ui)'  (26) 

2)  ”  " 

R+  denot->  the  positive  real  line. 


.  /■ 


where  f^  :  R+*R+-*'R+  Is  a  continuous  function  of  x^0 
and  u^,  and  is  strictly  increasing  in  Its  second  argu¬ 
ment.  Here,  x^Q  denotes  the  initial  armament  level  of 
DMi. 

In  order  to  obtain  some  explicit  results,  let  us 
adopt  the  quadratic  objective  functional  model  proposed 
by  Simaan  and  Cruz  [Simaan  and  Cruz  (1975a)  and  Simaan 
and  Cruz  (1975b)],  because  of  its  analytical  tact- 
ability  and  other  appealing  features  in  relation  with 
other  existing  models;  namely,  let 

Jl^Xl’X2,ul,U''^  *  1  fRi(u1"wi)2  +  Q|VXj-Six^-v1)2}(27) 
and 

xl‘fl(xio*ui>“8ixio*V  j»l,2;  if*j  (28) 

where 

Ri>°,  Q°  >  0,  Si>0,  0<6i<l,  i-1,2. 

Here,  x  denotes  the  given  initial  armamznt  level  of 
DMi  and  expression  (27)  reveals  the  fact  that  each  DM 
wants  to  reduce  the  gap  that  exists  between  his  arma¬ 
ment  level  and  a  linear  function  of  the  other  DM's 
armament  level,  and  at  the  same  time  wishes  to  minimize 
his  expenditure.  IWe  refer  to  [Simaan  and  Cruz  (1975b)] 
for  an  elaborated ! interpretation  of  (27).  Under  this 
set  up,  there  exidts  a  unique  pajLr  (uj,u£) ,  minimizing 

J(x,,x,,u ,  ,u,)  as  a  function  of  which  corre- 

lilt  j  ^ 

sponds  to  the  pair  (ut,vt)  in  the  general  discussion  of 
Sections  II  and  Vi 

As  it  may  bejthe  case,  one  of  the  DM's,  say  DM2, 
may  deviate  from  u2>  The  reason  behind  such  a  move 
may  be  that  DM2  totally  Ignores  the  cooperation,  and 
minimizes  his  own  objective  functional.  Assuming  that 
each  DM  can  monitor  the  decisions  of  his  adversary, 
this  situation  would  immediately  give  rise  to  a  Nash 
equilibrium  with  high  armament  expenditures.  Since  we 
have  assumed  that!  each  DM  desires  to  reduce  his  expen¬ 
ditures  while  maintaining  a  certain  balance  of  powers, 
such  a  unilateral!  and  large  deviation  will  be  unlikely. 
In  its  stead,  we  Will  assume  that  DM2  may  have  an 
incentive  to  perform  a  relatively  small  deviation  from 
the  Pareto  equilibrium  point,  being  motivated  by  one  of 
the  following  thr'ee  considerations: 

i)  DM2  may  decide  to  promote  his  relative  impor¬ 
tance  in  the  agreement,  which  is  reflected  by  an 
increase  in  the  value  of  a  from  a0  to  a+,  without 
Informing  DMI,  while  DMI  still  uses  the  value  a°  in 
his  objective  functional; 

ii)  DM2  may  develop  a  different  perception  of  the 
values  of  one  or  more  coefficients  in  the  team  objec¬ 
tive  functional  without  informing  DMI.  Let  us  assume, 
for  instance,  that  DM2  has  decided  to  place  higher 
priority  and  emphasis  on  reducing  the  gap  between  his 
armament  level  and  the  linear  functional  of  DMl's  arma¬ 
ment  level  than  on  minimizing  his  expenditure;  more 
precisely,  that  he  has  decided  to  increase  the  value 

of  Q2  from  to  Q*. 

Hi)  Both  i)  and  ii)  may  be  present. 

We  now  analyze  these  three  cases  separately. 

Case  (ii .  This  is  similar  to  the  analysis  of  Section 
IV.  The  optimal  strategy  for  DMI,  which  leads  to 
H  ,u|)  as  final  outcome,  independent  of  possible 
deviations  in  DY2's  perception  of  t»°,  is  given  by 


t 

i 
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Yli>(u2)  "  uX  +  P<t)(u2"u2)  (29) 

where 

p(i)  S2Q2tS2x2O:'urS2(Blx10'Hll)'v2)  1  (3Q) 

R2(u2“w2)'H32(82x20‘Hl2'S2(6lx10'Hll)‘v2) 


Case  (ii).  The  solution  here  again  follows  from  the 
analysis  of  Section  IV.  Hence,  there  exists  an  optimum 
insensitivity  strategy  realizing  the  team  solution 
independent  of  DM2's  different  perceptions  of  Q3,  and 
such  a  strategy  is  given  by 


y1U)<u2> 


ul  +  s77  <VU2>- 


(31) 


Case  (Hi).  This  case  involves  multiparameters  where 
condition  (20)  is  not  satisfied.  Hence,  within  the 
class  of  affine  policies,  there  does  not  exist  any 
element  which  makes  the  cost  of  DM1  completely  insen¬ 
sitive  to  discrepancies  in  DM2's  perceptions  in  more 
than  one  parameter.  In  order  to  overcome  this  dif¬ 
ficulty,  we  adopt,  as  discussed  in  Section  V,  the 
scalarized  sensitivity  function  Tr{ ^(o^.Q-j) ) •  and 
minimize  it  subject  to  the  constraint  that  the  stra¬ 
tegy  of  DM!  is  given  by 


Y(iii> 

Y1 


<“2> 


t  (iii) ,  C. 
Jj  +  P'  '(u2-u2). 


(32) 


This  problem  can  be  shown  to  admit  a  unique  solution 
which  can  oe  obtained  explicitly.  Hence,  when  DM1  is 
uncertain  about  DM2's  perception  of  both  a  and 
there  still  exists  an  affine  strategy  which  minimizes 
an  appropriate  scalar  function  representing  the  sensi¬ 
tivity  of  DM1' s  incurred  co  it  with  respect  to  devia¬ 
tions  in  these  coefficients  from  their  nominal  values, 
and  such  a  strategy  is  given  by  (32) . 

In  the  preceding  analysis,  P ^  is  the  same  coef¬ 
ficient  as  DM1  would  have  used  in  his  strategy  in  a 
Stackelberg  game  with  DM2  being  the  follower  and  DM1 
enforcing  the  point  (uj.u^).  On  the  other  hand,  in 
case  (ii) ,  by  announcing  a  strategy  of  the  form  (31), 
DM1  makes  DM2's  objective  functional  independent  of 
the  uncertain  coefficient  Q2.  Therefore,  DM2's  dis¬ 
crepancies  do  not  affect  the  team  solution  anymore. 
However,  when  the  number  of  uncertain  coefficients  is 
large  as  compared  with  the  dimension  of  DMs'  decision 
vectors,  there  still  exists  a  compromise,  which  is  to 
minimize  the  cumulative  effect  of  variations  of  uncer¬ 
tain  parameters  around  their  nominal  values: 
y(iii)(u2)  la  designed  to  perform  such  a  compromise. 

VII.  CONCLUDING  REMARKS 


In  this  paper  we  have  introduced  the  notion  of 
optimum  minimum  sensitivity  incentive  policies  in  team 
decision  problems  wherein  one  member  of  the  team  has  a 
somewhat  different  perception  of  the  common  goal  than 
the  other  one,  and  va  have  derived  explicit  incentive 
policies  which  render  the  incurred  value  of  the  team 
objective  functional  least  sensitive  to,  and  in  some 
cases  even  independent  of,  the  discrepancies  described 
above. 


of  his  subordinates.  There  is  an  underlying  goal,  or 
objective,  which  involves  a  successful  completion  of 
a  mission  o'-  task  (such  as  multi-object  tracking  and 
fire  control),  and  th's  goal  is  determiend  by  the  DM's 
at  the  top  of  the  hierarchy  in  rather  general  terms 
(l.e.,  not  in  fine  detail),  which  is  then  transmitted 
to  the  relevant  DM's  at  the  lower  levels. 

Hence,  in  a  general  framework,  a  C2  system 
involves  a  team  of  DM's  who  act  in  an  uncertain 
environment,  and  who  have  limitations  on  control  and 
communication  capabilities.  However,  realistically, 
this  is  not  strictly  a  team  problem,  because,  in  an 
uncertain  environment,  it  is  unlikely  that  every  DM 
will  develop  precisely  the  same  percention  of  the 
ultimate  goal  in  every  fine  detail.  In  fact,  in  order 
to  model  systems  as  team  problems,  it  is  absolutely 
necessary  that  all  DM's  have  exactly  the  same  percep¬ 
tion  of  an.  existing  cotrmon  goal  and  quantify  this 
perception  in  exactly  the  same  way.  Any  discrepancy 
thit  exists  between  the  perceptions  of  the  DM's  on  the 
underlying  common  goal  will  laad  to  a  decision  problem 
which  cannot  be  treated  as  a  team  problem,  ar.d  optimal 
decision  rules  derived  by  totally  ignoring  (or  over¬ 
looking)  this  aspect  of  the  problem  are  apt  to  lead 
to  outcomes  which  are  extremely  sensitive  even  to  small 
variations  in  the  perceptions  with  regard  to  real 
underlying  goals  of  the  mission.  The  approach  devel¬ 
oped  in  this  paper  remedies  this  deficiency  because  it 
takes  into  account  the  possibility  that  the  DMs’  per¬ 
ceptions  of  the  "team  goal"  may  deviate  from  the 
nominal  set  by  the  highest  level  decision  making  unit. 

Two  possible  extensions  of  the  general  approach, 
of  this  paper  are  to  dynamic  multi-stage  decision 
problems  and  to  stochastic  team  problems.  In  the 
latter  case  a  natural  source  of  discrepancy  is  the 
a  priori  statistical  information  which  is  normally 
assumed  to  be  shared  by  the  DM's.  A  recent  reference 
[Bajar  (1983)]  addresses  the  question  of  existence  of 
suiLable  equilibrium  solutions  for  such  problems  when 
there  is  discrepancy  in  the  subjective  probability 
measures  characterizing  the  probability  space. 
Derivation  of  minimum  sensitivity  incentive  policies 
in  this  context  is  currently  under  study. 
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Abstract  A  hierarchical  decision  problem  with  one 
leader  aid  a  continiiura  of  followers  i  investigated. 
The  prjblem  Is  formula.  as  a  stochastic  Stackelberg 
gme  with  nonnested  information  structure,  and  studied 
by  using  the  Inducible  Region  concept.  For  the 

single-stage  case,  weVshow  that  the  inducible  region 
can  be  delineated,  and  the  optimal  Stackelberg 
strategy  can  be  constructed.  These  results  are  then 
extended  to  the  tuo-stage  case.  Although  the  problem 
is  formulated  In  a  pricing  context  in  terms  of 
companies  and  customers,  the  formulation  can  be 
interpreted  differently  to  model  0,  related  problems. r- 

I  ' N 

2  Introduction  /  j 

1*1  Motivation  •-  *  T»  >f‘~  $ 

Tne  leader-follower  type  of  problems,  also  known  as 
the  Stackelberg  games,  deal  with  multi  -pweson 
hlerarclcal  decision  problems.  leclslon  makers  in 
such  a  problem  belong  to  different  levels  of  a 
hierarchy  and  have  asymmetric  roles.  A  higher  level 
decision  maker  (a  leader)  has  the  authority  to 
announce  his  strategy  ar.d  impose  it  on  the  decision 
makers  under  him  (the  followers).  The  follower:., 
knowing  the  leader's  strategy,  are  assumed  to  react 
rationally  to  minimize  their  own  cost  functions.  The 
leader's  objective  is  then  to  design  his  strategy,  by 
taking  into  account  the  followers'  reactions,  to 
minimize  his  cost.  Since  our  society  is  essentially 
hierarchical  in  structure,  there  exist  ample  instances 
which  can  be  modeled  by  such  a  framewjrk.  Among  them 
are  resource  allocation,  organization  theory,  pricing 
problems,  and  military  conraand,  control  and 
ccmmunlction  (CJ)  systems. 

As  an  example,  consider  the  problem  of  platform 
positioning  in  a  naval  battle  group.  Ships  in  a  naval 
battle  group  have  various  levels  of  defensive 
capability  against  emtny’s  «ir,  surface  and  subsurface 
threats,  lhe  overall  defensive  effectiveness  of  the 
battle  group  depends  not  only  on  the  capabilities  of 
its  member  ships,  but  also  on  the  positioning  of  all 
the  ships  <CAS82>.  In  one  of  the  doctrines,  the  ships 
are  partitioned  into  subgroups  for  different  defensive 
functions,  such  as  antlalr  and  antisubmarine.  Each 
subgroup  has  a  function  commander  who  is  required  to 
position  his  ships  to  maximize  his  functional 
defensive  effectiveness.  The  overall  defensive 
effectiveness  is  then  determined  by  these  functional 
measures.  The  allocation  of  the  ships  to  subgroups  by 
the  top  commander,  the  coordination  and  competition  of 
the  function  commanders  to  maximize  thsir  own  or  the 
group  performance,  can  be  described  as  a  hierarchical 
decision  problem. 

As  mentioned,  the  leader's  major  task  in  such  a 
problem  is  the  design  of  his  strategy,  by  taking 
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into  account  the  follower's  reactions,  to  minimize  his 
cost.  Mathematically,  this  implies  the  presence  of 
composite  functions  in  the  leader's  optimization 
problem  (as  will  become  clear  in  subsection  2.2).  Any 
direct  solution  to  such  a  problem  seems  almost 
impossible,  except  for  special  cases.  Most  of  the 
recent  advances  are  due  to  the  discover ir’  of  tvo 
indirect  approaches,  the  team  soluble  approach 
«BAS79, 81  >,  <T0L81a>,  <H082>,  <ZH£82»,  and  the 
Inducible  Region  approach  i<CHA82i,82bl83>l 
<LHH83,S4>,  <TOL81b,83». 

In  this  paper,  we  use  the  inducible  region  concept 
to  study  a  class  of  Stackelberg  games,  where  there  is 
one  leader  and  a  continuum  of  followers.  Our  rurpose 
is  not  to  .model  aid  solve  a  realistic,  or  even 
simplified  Cs  problem.  Rather,  we  shall  adopt  a  very 
basic  model  to  study  hierarchical  decisionmaking,  and 
demonstrate  how  it  can  be  solved.  For  some  reasons, 
the  problem  is  formulated  in  a  pricing  context  in 
terms  of  companies  and  customers.  It  should  be  noted, 
however,  that  the  formulation  can  be  interpreted 
differently  to  model  related  problems. 

JL2  The  Pricing  Problem  and  Outline  of  the  Paper 
~  Consider  a  pricing  problem  in  which  a  company  sells 
a  certain  kind  of  product  to  its  customers.  The 
company  has  to  design  its  pricing  scheme,  and  make  it 
known  to  the  customers.  Knowing  the  pricing  scheme, 
each  customer  then  decides  the  Mount  to  purchase. 
Therefore  the  problem  can  be  viewed  as  a  Stackelberg 
game,  with  •■he  company  as  the  leader  and  customers  as 
followers.  One  way_  to  model  a  large  nuaber  of 
customers  is  by  means  of  a  histogram  based  on  a  key 
paraneter  such  as  customer' 3  valuation  of  the  product. 
After  normalization,  the  histrgram  can  be  thought  of 
as  a  probability  density  function.  With  the  customer 
population  being  so  described,  we  have  a  stochastic 
problem.  If  price  differential  is  not  allowed,  the 
company  can  not  observe  and  make  use  of  customers' 
private  information  (individual’s  valuation  of  the 
product).  We  therefo.  e  end  up  with  a  non-nested 
stochastic  Stackelberg  Game.  This  model  can  also  be 
interpreted  as  though  there  is  a  single  follower  who 
possesses  some  private  information  not  knom  to  the 
leader.  Problems  of  this  class  have  been  treated  to 
some  extent  by  economists,  e.g.,<WEI7|1>,<SPE77>. 
There  is  no  existing  results  in  the  control  or  game 
literature. 

In  Section  2,  we  study  a  single-stage  pricing 
problem.  The  model  is  formulated  in  subsection  2.1. 
In  subsection  2.2,  we  define  the  inducible  region  for 
such  a  stochastic  game.  We  then  In  subsection  2.3  go 
through  a  sequence  of  steps  to  delineate  the  inducible 
reg'on,  and  provide  a  systematic  way  to  solve  such  a 
problem. 

In  Section  3,  we  study  a  two-stage  pricing  problem 

where  a  company  sells  a  product  to  a  continuum  of 

customers  during  two  consecutive  time  periods.  The 

dynamic  information  Involved,  and  the 
substitution/  complement  effects  (to  be  defined  in 
subsection  3.2)  of  the  product  at  different  time 

periods  make  the  problem  more  complicated.  The 
formulation  is  given  in  subsection  3.1.  In  subsection 
3.2,  we  characterize  thr  customers'  reactions  to  a 
given  pricing  scheme. 


v.--. 
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The  Inducible  region  for  the  complementary  case  ia 
then  delineated  in  subsection  3,3,  where  we  also  show 
how  to  find  optimal  consumption  curves,  and  optimal 
incentive  pricing  schemes.  The  substitution  case  is 
investigated  in  subsection  3.4.  Concluding  remarks 
and  comparison  to  Spence's  work  are  given  in  Section 
4. 


2  Pricing  Problems 
Stsckelberg  Games 


Hon-nested  Stochasti  i 


2-2  Problem  Formulation 

Consider  a  pricing  problem  where  there  is  a  company 
(leader,  CMO)  that  sells  a  product  to  a  continuum  of 
customers  (followers).  The  customers  are  indexed  by  a 
par  meter  z,  indicating  individuals'  valuation  of 
goods.  As  sane  that  ze  Z  a  [z  ,  zj,  and  is  described 
by  a  probability  density  function  P2(z)  known  to  the 
leader.  A  customer  with  parameter  z  la  denoted  as 
DM1  .  Let  r_  denote  the  company's  pricing  scheme.  If 
DM  1*  purchases  u.  (>0)  units  of  the  product,  he  has  to 
pay  >‘q(u1)  dollars.  The  social  gain  can  be  represented 


•Ijtaj.z)  *  S(u1,z)  -  rQ(u1). 


(2. 1.1) 


The  function  S^-.z)  is  DM1  's  satisfaction  in  dolla* 
value  by  purchasing  u.  units  of  the  product.  We 
assume  that  S  is  twice  differentiable  with  S  >0, 
so  that  a  customer  with  higher  valuation  of  goo&s  has 
higher  marginal  satisfaction  at  every  level  of  u.. 
The  function  r  (u.)  is  assumed  to  be  twice 

differentiable  for  u,  >0.  For  a  given  r_,  DM1 
decides  an  optimal  u1  to  “maximize  l.e.,  0  * 


max  (  S(u  ,z) 
U1 


r0(u1)  5 


subject  to 


U1  2  °* 


(2.1.2) 


The  solution  of  (2.1.2)  is  denoted  as  u,°(z).  It 
represents  CHI  ‘a  "reaction"  to  the  given'  r..  The 
price  he  has  \o  pay  for  it  is  u.=r  (u.  (zT).  For 
simplicity,  we  assume  that  u.  (zT  exists  and  is 
unique.  If  u^  (z)  la  not  unique,  CMO  has  to  assume 
the  worst  case  in  his  strategy  designing  stage.  This 
complicates  the  derivation,  yet  adds  very  little 
understanding  to  the  problem. 

The  company's  payoff  function  is  described  by 


J0  *j  l^Lo(rO<ul)'u1<z),t)PZ<l>,dz’ 


*  E  (  L0  ). 


(2.  1.3) 


Jq  can  be  the  profit,  a  social  welfare  function,  or 
any  meaningful  function  flrom  the  company's  viewpoint. 
For  the  moment,  however,  we  shall  not  specify  I.- 
expllcitly.  Knowing  the  density  function  P-(z)  and 
followers'  rationale  in  reaction  ((2.1.2)),  WO  wants 
to  select  a  strategy  rQ  from  some  admissible  set  T.  to 
maximize  4  .  In  other  words,  CMO  solves  the  following 
problem: 

(P-1)  max  E  (L0(r0(u1),u1(z),z))  subject  to  (2.1.2). 


r0  ef0 


(2.1.4) 


The  decision  sequence  is  summarized  as  follows. 

Wz)  •  :o(<)  _ '  wv _ 

•  u,°(z) 

DM0'  Prior  :  ,  ' 

Information  j  Design  Stage  |  Execution  Stage 
Fig.  2.1 

fro:.*lea  (P-1)  is  a  single-stage,  stochastic 


Stsckelberg  game.  The  information  structure  is  non¬ 
nested,  since  each  follower  knows  his  own  parmaeter  z, 
but  the  leader  can  not  observe  and  make  use  of  it. 
The  model,  depending  on  the  functional  form  of  L.,  can 
also  be  Interpreted  as  though  there  is  a  single 
follower  who  possesses  as  me  private  information  z  not 
known  to  the  leader. 

2.2  Inducible  Region  for  Stochastic  Games 
"  "Problem  (P-1)  la  generally  an  Intractable  probloa 
due  to  the  presence  of  the  composite  function 
r . (u ^ z) ) ,  and  the  non-nested  information  structure. 
In  this  section,  we  shall  define  the  Inducible  Region 
for  this  stochastic  problem,  and  use  it  to  solve  the 
pricing  problem. 

For  a  given  r.,  each  follower  solves  (2.1.2)  to 
obtain  Uj°(z).  The  mapping  from  2  to  U.  generated  by 
u,(z)  is  denoted  as  r..  We  then  say  tnat  the  r.  is 
Induced  by  this  r.,  ana  (r.,^)  is  an  inducible  pair 
of  strategies.  For  a  different  DHO's  strategy,  the 
followers'  reactions  will  be  changed  accordingly,  and 
we  have  another  inducible  pair  of  strategies.  The 
Inducible  Region  in  the  Strategy  Space.  IBS,  is  t'uen 
defined  as  the  collection  of  all  the  inducible  pairs 
of  strategies,  l.e., 

IBS  *  Hr-.r^):  (rQ,r1 )  is  an  inducible  pair  of 
strategies) . 

(2.2.1) 

Since  any  realizable  strategy  pair  must  belong  to  IRS, 
(P-1)  is  equivalent  to  the  following  problem: 


(r0,r,)eiRS 


E  (  Lo(ro«ri»*) 


(2.2.2) 


In  (P-2),  there  is  no  composite  function.  However, 
IRS  is  a  set  of  pairs  of  functions  and  is  herd  to  be 
delineated.  Furthermore,  even  if  IRS  can  be 
cSllneated,  it  is  still  not  easy  to  perform  the 
maximization,  since  it  Involves  rQ  and  r.  at  the  same 
time.  Motivated  by  results °  of  deterministic 
Stsckelberg  games,  we  shall  next  find  IRS's 
counterpart  in  the  decision  space. 

Consider  a  pair  (r.,r1 )  e  IRS.  DM1  's  decision  is 
given  by  u.  s  r,(z).  The  corresponding  u.  is 
determined  by  rQ(z)  s  r.lr.d))  j  fQ(z),  where  rD(z) 
is  defined  as  the  composite  function  of  rQ  and  r,  Chat 
maps  Z  into  UQ.  The  pair  (u.sf.(z) ,u .sr. (z) )  can  be 
regarded  as  the  outcome  for  DRlufor  this  rQ.  For  a 
follower  with  parameter  z' ,  the  corresponding  outcome 
is  given  by  (fQ(z’ ),r1  (z’ )).  Thus  across  the 
population,  the  outcomes  are  described  by  the  pair  of 
functions  (f-.r^).  We  then  define  IBP.  Inducible 
region  in  theu decision  space,  as  “ 

3D  *  f(f0,r.):  3  (rn,r.)e  IRS  s.t.  1 

f„(!)  !  r.(r.(:))  »:cll.  1 

(2.2.3) 

It  is  dear  that  (P-2)  is  equivalent  to  the  following 
optimization  problem:  ~ 


(P-3)  max  E  l  L^f-.r-.z)  ), 

f  #  —  %  -TBit  W  W  1 


(f0,r,)  eIRD 


(2.2.4) 


vdiere  appropriate  substitutions  of  r«  by  f.  in  L.  are 
assumed.  For  deterministic  problems,  IRD  is  a  subset 
in  the  decision  space(<CP*82b) ,<LUH83, 84>).  For  our 
problem,  however,  due  to  the  presence  of  a  continuum 
of  customers,  IRD  remains  to  be  a  set  of  pairs  of 
functions. 

2.3  The  Inducible  Re-:. on  Approach  2.3.J,  Customers' 
First  and  Second  Order  Necessary  Conditions 

In  order  to  delineate  IRD  and  solve  the  problem,  we 
shsll  first  examine  a  customer's  reaction  to  a  pricing 


/  \ 


/-  u 


.■  v 


id  /  / 


erne.  For  a  given  rQ,  a  customer  has  to  decide 
thar  to  buy  the  product  or  not.  Assume  r.{0)  =  0, 
„  a  customer  pays  nothing  if  he  does  not  buy  it. 


(0.x)  >  (S(u.,z)  -  rn(u.))  W.u.  >0, 

1  01  1  (2.3. 1.1) 

customer  will  not  purchase  any  of  the  product, 
.,  u.  =  0.  Let  z.  be  the  value  of  z  such  that  u^  = 
for  z  <  z  .  thus,  z  can  be  thought  of  as  the 
i try*  point.? 

Consider  a  customer  with  z  >  z^.  The  first  order 
'essary  condition  of  (2. 1.2)  is 


dJ^/du^  *  0,  or 
su,  *  dro/dur 


<2. 3. 1.2) 


i  solution  will  lead  to  the  function  r^z).  the 
x>nd  order  necessary  condition  is 

d^./du.2  a  (S  -  d2rn/du  2)  <  0. 

1  1  u1u1  0  1  <2.3. 1.3) 

To  get  more  explicit  result,  we  take  total 
‘lvative  on  both  sides  of  (2.3. 1.2)  with  respect  to 
He  have 

(Su  u  )  dr^/dz  ♦  3U  x  *  (d^g/du^)  dr^/dz,  or 
l  S  -  (d2rn/du  2)  }  dr-/d2  a  -Slt  (2.3.1.**) 

U|U?  LI  I  UJZ 

nee  S  .  >  0  by  assunptlon,  the  right  hand  side  of 

.3. 1.«zls  not  zero,  ttierefore,  neither  (  S  -  .  - 
rQ/du,  )  nor  dr./dz  la  zero.  Equation  (2.3.  w3) 
’  en  implies  that 

d2J,/dn.2  *  {-S„  /(dr./dz) )  <  0,  and 

11  u1*  1  (2.3. 1.5) 


dr(/dz  >  0. 
thus  have 


(2.3. 1.5) 

(2.3. 1.6) 


asms  2.U  the  customers'  reactions  r.(z)  is  a 
rictly  increasing  function  of  z  for  z  >  z,  (l.e.,  a 
stoaicr  with  higher  valuation  bu/s  more  goods), 
so,  (2.3. 1.2)  and  (2.3. 1.5)  constitute  a  set  of 
uifialent  conditions  for  customers  with  z  >  z^. 

Since  r.,(z)  is  strictly  increasing  fo^z  >  zd>  its 
iversc  mapping  exists,  denoted  as  zsxj”  (uf).  thus 
though  the  company  can  not  observe  directly  a 
stomer's  private  information  z,  however,  by  knowing 
.s  decision  u1t  the  company  can  indirectly  figure  it 
it.  this  point,  as  a  result  of  the  assunptlon  that 
>  0,  turns  out  to  be  crucial  in  our  design  of  the 
npeny's  pricing  scheme, 

•2-2  Construction  of  the  Composite  Function  fg 

Define  the  Set  of  Reactions,  SR,  as  the  collection 
f  all  inducible  reactions,  l.e., 

1 

SR  *  {  r,:3r0  s.t.  (rg,^)  eIRS  },  or 


*  (  r,s  JfQ  s.t.  (fg,r ^ )  eIRD  ). 


a  have  the  following  result. 


(2. 3. 2.1) 


heorem  2*2*  For  any  r.  eSR,  its  corresponding  f-  can 
e  constructed  within  the  limit  of  a  constant  term, 
nd  is  given  by 

r„(z)  *  S(r.(z),z)  -  |  <33/3z)dz'  -  C., 

0  1  Jz„  1  (2. 3. 2. 2) 


»  C  f  s’  (r.  ’(u  '),u')du  •  ♦  CJ 

J  0U1  1  1  1  1  ■  lu.  =  r.(z), 

1  (2.13.2.3) 

where  C.  and  C_  are  constants  to  be  determined. 
Furthermore,  one  rQ  that  Induces  the  r^  is 

r0(u.)  *  (sj  (r  “1(u'),u  .»)  du'  ♦  C,. 

0  1  J  0U 1  1111  Z  (2.3.2.D 

Due  to  limit  on  page  size,  proofs  for  theorem  2.1  and 
remaining  theorems  are  not  Included  here.  For  their 
proofs,  please  see  <NIN83>. 

2.3.3  Solution  Method 

Theorem  2.1  is  very  useful  since  it  says  that  for 
any  r.eSR,  its  corresponding  f.,  as  well  as  rQ,  can 
be  constructed  within  the  limit  of  a  constant,  thus 
the  optimization  of  J.  in  (P-3)  wl.h  respect  to  (f-, 
r.)  over  the  set  IRD  can  essentially  be  reduced  to  the 
optimization  of  J.  with  respect  to  r.  over  the  set  SR. 
The  constant  term  in  f.  is  related  to  z.,  and  can 
generally  be  resolved  separately.  He  tnua  have  a 
reduced  version  of  (P-3)  as  follows: 


max  E  (  L.(r.,z)  ), 
r,eSR 


(2. 3. 3.1) 


where  appropriate  substitutions  of  f„  by  (2. 3.2. 7)  or 
(2. 3.2. 9)  are  assumed.  Note  that  (P-4)  is  a 
maximization  of  J.  with  respect  to  a  single  function 
r.  over  the  set  SR,  rather  than  a  maximization  with 
respect  to  two  functions  as  in  (P-3) .  The  next  theorem 
then  delineates  SR. 

Theorem  2.2:  ’he  Set  of  Reaction  SR  is  given  by 

SR  a  (  r.:  dr./dz  >  0  for  z  >  z.,  and  r.(z)  =  0 
for  z  <  zd,  zde  [z  ,  z"  ]). 

d  d  m  "  (2.3. 3.2) 


Problem  (P-4)  can  be  rewritten  *s  follows: 

(P-4')  max  E  {LQ(r.,z))  s.t.  dr./dz  >  0  for  z>zd  and 
r i  r,(z)a0  for  z<zd>  VlVzM]* 

<2. 3. 3.3) 

It  is  an  "optimal  control  like"  problem  with  an 
inequality  constraint.  Although  its  solution  is 
nontrivial  and  subject  to  further  investigation, 
nevertheless,  we  have  converted  the  original 
intractable  problem  (P-1)  to  the  comparatively  much 
simpler  problem  (P-4),  where  its  solution,  or  8t  least 
e  -solution,  is  promising.  Chce  (P-4)  is  solved, 
its  result  r.  is  then  the  best  inducible  r..  the 
optimal  pricing  strategy  can  then  be  constructed 
according  to  (2.3. 2. 8).  For  completeness,  the  entire 
lnduclbl'i  region  IRD  is  presented  as  follows. 

Theorem  2.3?  IRD  is  delineated  by 

IRD  a  (  (f.,r  ):  r.e  SR,  and  f-  is  given  by  (2. 3.2. 7) 
or  (2.  3.  2.9)  ). 

1  Two-Stage  Pricing  Problems 

3. J,  Problem  Formulation 

in  this  section,  we  extend  the  model  and  the 
Inducible-Region  approach  of  Section  2  to  study  a  two- 
stage  pricing  problem,  where  a  company  sells  a  product 
to  a  continuum  of  customers  at  two  consecutive  time 
periods.  In  many  situations,  once  a  customer  decides 
to  enter  a  market,  he  has  to  pay  an  entrance  fee 
regardless  his  consumption  level.  For  instance,  a 
telephone  company  charges  a  basic  service  fee  to  a 
customer  in  the  monthly  bill  even  though  he  did  not 
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use  the  telephone  for  the  entire  month.  We  shall 
therefore  iua« 

(  rQ1  ♦  r.,  ♦  A  for  committed  customer, 

r0  * 

l  0  otherwise, 

(3. 1.1) 

where  rfl.  denote  the  company's  pricing  strategy  at 
stage  1,  and  A  the  entrance  fee.  let  u„(z)  denote 
the  mkount  of  the  product  purchased  by  Writ  stage  1 
(1x1,2).  Alos,  let  y.  denote  the  on-line2 uncertainty 
at  stage  1  such  as  father  that  affect  customers' 
aatisfactlon  aa  well  as  the  c  japan  y's  payoff.  It  Is 
described  by  the  probability  density  functions 
PTi(y.).  For  simplicity,  we  assists  that  y.,  y- and  z 
are  mutually  Independent.  Finally,  similar  to  section 
2,  we  let  r..  denote  the  consuaption  curve  at  stage  1. 
The  sequenerof  actions  is  summa*lzed  In  Fig.  3.1.  In 
this  figure,  g  .  and  g^  denote,  respectively,  the 
Information  available  to  the  company  and  EMI  at  stage 
1,  and  are  given  by 

gz1  *  J2*  y1}  ^01»  r02)» 

*pi :  yivJn)*  _  . 


li,  J,«  «,,«  Jji 

(y,»  y2.  u12). 


<3. 1.2) 


y  m  m  r _ _ 

r01,*r02  !  y1  u01,ar01<gp1)  !  ,yr  u02*r02(gp2> 


|Un*rii(gzi)  ! 

•  11  11  21  Fig.  3.1 


u12ar12(gz2) 


We  let  S(z,y1,y_,u  ,u,_)  represent  EMI  's 

satisfaction  In  dollar  value,  and  the  social  galnzis 
given  by 

4,  -  S(i.y1.y2,u11,Ui2)-rai(y,,Uii)-r0,(y,,y2.u^)-A.) 

We  assuaa  that  r-.  tod  S  are  twice  differentiable  for 
u  >  C,  with  ‘S  „  >  0  and  S  >  0,  i.e.,  a 
customer  with  larger  z  has  higher  marginal 
satisfaction  for  every  level  cf  purchase  at  both 
stages.  We  also  assume  fur  simplicity  that  r..  is 
continuous  at  u.,  *  0  with  value  zero  If  u.-eO.  1 
For  a  given11  pricing  scheme  (rQ1,  rA  and  A), 
customers  decide  their  optimal  consumption  strategies 
by  maximizing  J^,  i.e., 

max  E  {  4  }. 

r,,,  f12  (3.1.*) 

Let  be  the  payoff  of  EMI  if  he  does  not  enter  the 
market,  I.e., 


4  I  '  E  (  3(z,y.,y_,  u..*0,  u..sO)  }• 

_  (3.1.5) 

Also  let  r .  be  the  payoff  of  EMI  If  he  enters  the 
market,  I.e.,  2 


x  i  (S(z,y  i*y2,u  11,U  i2^”roi  ^y  1,u  1 1  ^"r02^y  1*y 2*u  ' 


with  uf]  >  0  and  u12  >  0. 


(3.1.6) 


EMI  will  enter  the  market  If  P .  >  J°.  for  some  un  > 
0  mid  u12  >  0.  Let  z.  denote  the  "entry  point",  i.e .7 
a  customer  with  z  >°z  will  enter  the  mfrket.  For 
simplicity  of  discustlon,  we  shall  assune  In  this 
section  that  z^  >  z  .  Then  since  r.  *  J°.  at  z  s  z  , 
A  and  z^  are  related1  by  the  following  equation  : 

A  .E(S(zd,y1,y2,u11,u12)-  rQ,  (y1'u  1 1  >“*‘02<y l'y 2"u  12)} 
-  E  {  S(zd.y1.y2,0,0)  ).  (3.1.7) 


Knowing  the  customers'  ratlonale((3. 1.*)),  the 
company  then  designs  r..,  r.  ..  and  A  to  maximize  his 
payoff  JQ,  where 


VE1  E"0^r11,r12'r01'r02^  ** 


(3.1. 8) 


That  is,  the  company  solves  the  following  optimization 
problem: 


max  l  40  ) 
r01*  r02' 


subject  to  (3. 1.*). 


The  problem  Is  thus  s  two-stage,  stochastic 
Staokelberg  game.  Similar  to  the  single-stage  pricing 
probla  of  Section  2,  the  information  structure  Is 
non-nested  since  the  company  cai  not  observe  :  id  make 
use  of  the  Information  z.  We  shall  ne.:t  solve  tills 
problem  by  extending  the  Inducible  region  approach  of 
Section  2. 

2«2  Reactions  of  Customer  s 

“For  simplicity,  we  shall  assuae  that  once  a 
customer  decides  to  enter  the  market,  he  will  purchase 
the  product  at  both  periods  (I.e.,  u^  >  0  and  -i..  > 
0).  These  customers  solve  a  two-stage  dynallc 
optimization  problem,  I.e., 


max  E  W1(r11.r12.r01,r02)  |gl2),  and 
r12 

max  E  CJ1<r,|.r12».r0|,r02)  |  sr1>. 


(3.2.D 


(3.5.2) 


where  r..*  denotes  the  solution  of  (3.2.1).  For 
simplicity,  we  assune  that  both  (3.2.1)  and  (3.2.2) 
have  unique  solutions.  To  better  characterize 
customers'  reactions  for  a  given  pricing  scheme,  we 
shall  first  examine  neoessary  conditions  of  (3.2.1) 
and  (3.2.2).  The  first  and  second  order  necessary 
conditions  of  (3.2.0  are  as  follows: 


d4(/du12  a  0,  and 
d24t/du122  <  0 


for  u12  >0. 


■  3r.,/3u 


02,OU12  * 

-  32r02/3u12*>  <  0  for  u12  >  0. 


(3.2.3) 

(3.2.*) 


•  Ut^J12  02-  -12  - - -12  '  -  ’ 

Note  that  there  Is  no  conditional  expectation  In  the 
above  equations  since  at  the  second  period,  g  - 
Includes  everything.  Similarly,  the  necessary 
conditions  for  (3.2.2)  are 


d(Ey  {  J,  JJ/du,,  •  0,  and  (3.2.5) 

d2(Ey  (JjO/du^i  0  for  ut1  >  0.  (3.2.6) 

The  expectations  are  taken  over  y2,  I.e.,  the  on-line 
uncertainty  at  stage  2.  From  (3.2.5)  mid  (3.2.3),  we 

have 

Ey2lSu11-3r01^u11  *(  Su12-3r02/Su12)dr12/du11J 


'  y2  *  'u11  *3r°l/3U11  1  *  °*  . 

Therefore,  (3.2.5)  becomes 

1  5  )*  3r-./3u...  (3.2.8) 

’2  11 

Similarly,  we  can  expand  (3.2.6)  tu  obtain 
Ey,  tSu  u  u  <^^12/^u11)-^^^o1/^u11^,  i  °* 

y2  u  1 1u  1 1  u  11U 12  1Z  11  01  11  T3.2.9) 

We  have 


(3.2.7) 


1  *  .  •  •••.».  ■ 
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result 


■a  3.1:  If  (S 


(dr, ,/dz) )  >  0,  then 


BS  i  _•  “  '  in  12  ii  ll'l  12'  11  '  * 

j ,/di  >  0  for  j  /Y,  r.e.,  r12(z)  Is  a  strictly 
:r  easing  function  of  z. 


:r  easing  function 
•ilarly,  we  have 


■=•  1-2*  ^  Ex2  «S  *  Su.Ui12(3r12/3,z>>  >  °. 

an  dr^/dz  >  r  for  :  >  z“,  1  .e r .  1  (z)  Is  a 
rletly  Increasing  function  ofaz.  ’ 


Theorem  3_.ji:  For  the  case  where  S  «.  «_  >  0,  the  Set 
of  Reactions  SR  Is  given  by  vl  ‘ 

SRs{(r, ,,r ,,):&r, ,/dz  >  0  and  dr, ,/dz  >  0  for  z  >  z., 
1  'and  rn(z)  s  rl2(z)'I  0  for  z  <  zd,  i 
C  [ 2_  ,Z.J  }  • 

m  W  (3.3.3) 


Lemma  3.1  and  Lemma  3.2  are  useful  since  If  they 
Id,  then  both  r^  and  r12  are  strictly  Increasing 
nctl^ns  of  z,  and  'theltj  Inverse  mappings 

rt,  (u,,;v,)  »d  *sri2^ <u12sy  l,y2J  ex  lit . 
nnequently,  knowing  a  customer's  decision  u^  or 
the  company  can  figure  out  his  parameter  z. 
k'llar  to  the  single-stage  case,  this  point  is 
uc'al  ii  designing  the  caapary's  pricing  schemes. 

fu-ly  u.ilize  Lemma  3.1  ar.d  Lemma  3.2,  we  have  to 
■sine  the  conditions  under  which  they  are  satisfied. 

.  shall  first  characterize  the  term  3r.,/3z  In  Lemma 

2. 

wa  3.3:  ’f  S  >0,  then  3r,0/3z  >  0  for  z  >  z.. 
- —  *  zu  12  12  d 

Note  that  the  term  S  appears  in  both  Lemma 

1  and  Lemma  3.  •<  He  'ihall  next  consider  the  case 


Now  similar  to  Section  2,  the  original  Intractable 
problem  (P-i)  la  equivalent  to 


uttu12 


(3.2.19) 


.e.,  a  customer's  marginal  satisfaction  at  one  stage 
l  enhanced  as  the  purchased  mount  at  the  other  stage 
i  increased.  In  other  words,  the  product  at 
Ifferent  time  periods  have  complement  effect.  The 
ther  situation  when  the  substitution  effect  exists, 
.e.,  S  <  0,  will  be  discussed  In  Section  3.4. 

;  cm  Ve,u easily  checked  that  if  S  ..  >  0,  then 

Ince  S  ..  >  0  and  £_  12  >  0  byu  assumption,  the 
ondiiiollb  of  Lemma  3.1  and  Lemma  3.2  hold.  We 
here  fore  have  the  following  result. 

I 

heorem  J.J,:  For  the  two-stage  pricing  •.  problem  as 
insulated.  if  S  >  0,  then  r..  and  r.,  are 

trietly  increasing  nmctlons  of  r  for  z  >  iz. . 

i 

•2  Inducible  Region  and  Stackelberg  Strategies 
In  this  subsection,  we  shall  first  show  by 
onstructive  proof  that  for  any  pair  (r^Jf  r.,)  of 
trlctly  Increasing  functions  of  z,  there  exllts  a 
air  of  Incentive  pricing  schemes  to  lndhce  It.  We 
hen  delineate  the  entire  Inducible  region  for  tfcs 
ase  when  S  >  0. 


(P-2)  max  E!L0'!fl11r,2>  ). 


(3.3.4) 


where  appropriate  substitutions  of  r..  and  r_,  by 
(3.3.1)  aid  (3.3.2)  are  assumed.  Furthermore,  z. , 
rather  than  A,  is  treated  as  an  Independent  parameter 
to  be  optimized.  Consequently,  the  remaining  problem 
can  be  solved  by  going  through  the  following  steps: 

(1)  find  necessary  conditions  for  r12  and  r?1  by 
solving  an  optimal  control  like  problwi  at1  each 
period, 

(2)  find  necessary  conditions  for  z  , 

(3)  solve  conditions  derived  from  (1)  and  (2) 
simultaneously  or  sequentially,  and 

(4)  construct  the  optimal  Incentive  pricing  schemes 
according  to  (3.3.1),  (3.3.2)  and  (2,1.7). 

Finally,  for  completeness,  we  hav* 

Theorem  3.4:  For  goods  with  complement  effect,  IRS  is 
delineated  by 

IS?  *  l(*,roi'r02'r  1 1'1' ijj)*  (h^ ,,f ,j)6SP.  A*  rni»  rQ2 
are  given  by  (T.1.7),  (3.51)  and  (3-3.2?f 


respectively) . 


(3.3.5) 


J.4  The  Case  of  Substitutive  Goods 

In  this  section,  we  shall  discuss  the  case  tfiere 
the  product  at  different  time  periods  have 
substitution  effect.  It  can  be  checked  that  now  the 
conditions  of  Lemma  3.1  and  3.2  might  not  be 
satisfied.  Therefore,  the  results  of  the  previous 
section  do  not  hold  in  general.  We  shall  Impose 
appropriate  conditions  on  both  the  satisfaction 
function  S  and  the  pricing  strategies  (rQ1.  r_,),  so 
that  those  conditions  remain  valid.  01  02 

Consider  an  S  with 


heorem  3.2:  For  any  pair  of  functions  (r..,  r._) 
1th  dr.j,/dz  >  0  and  dri2/dz>  0  for  z>  z.,  there  exist 
i  pair  of  Incentive  pricing  schemes  (r^,  rg2)e  ^  qi  * 
,,  to  Induce  them.  They  are  given  by 


01 

‘^2tSu11tr1l“1«y  1*y2*u1l’'r12<r1l“1*y1'v2*u11*>,duir 

(3.3.1) 


-q3  <  0, 


s  -du  <  0.  and  S 


u  12^12 


*  -q«-  <  o. 


(3.4.1) 

(3.4.2) 

(1.4.3) 


The  negativity  of  S  Implies  tliat  the  product  at 

different  time  periods  2  are  substitutes.  Equation 
(3.4.3)  says  that,  at  each  stage,  a  customer's 
marginal  satisfaction  decreases  as  the  amrunt  of 
purchase  increases.  We  then  have 


r02  ’(Su,2(r12_1'yi.y2,  r11<ri2’1.y1>.u12,>  du12'. 


(3.3.2) 


The  required  entrance  fee  A  Is  given  by  (3.1.7). 

Similar  to  the  single-stage  case,  we  can  define 
IRS,  the  IndualblJ  Region  in  the  Strategy  Space,  and 
SR,  the  Set  of  Reactions.  "He  THen  have  the  following 


Theorem  3.5:  For  an  S  satisfying  (3.4.1)  to  (3.4.3), 
suppose  that 


q5  >  (q2/q,)  q3,  and 
<iii  >  (<i1/<i2)  q3- 


(3.4.4) 

(3.4.5) 


Then  with  affine  pricing  schemes  (r.,  being  affine  In 
Uj.),  the  custouiers'  reactions  are  strictly  increasing 
functions  of  z. 


\ 

\ 


Intitutively,  (3.4.4)  and  (3-4.5)  imply  that  the 
sitostitutlon  effect  is  weak-  To  9ee  this,  we  first 
assune  that  q.  Z  q-.  In  this  case,  (3-3.-)  and 
(3.4.5)  become  q,  <  q-  and  q,  <  q^.  That  Is,  the 
substitution  effect  (q,1  is  weaker  than  tne  decreasing 
in  marginal  jatisfeetron  within  each  period  (recall 
that  —  3  S/3u..  ,  and  hi,  —  3  S/3u,.  )•  If  q. 
q  ,  (3-3-5)  and  (3-4.5)  then5imply  that  q.  should  be 
weaker  than  some  scaled  version  of  q^  and  qi. 

Under  the  conditions  of  Theorem  3-5,  r^j  and  r^« 
are  strictly  increasing  functions  of  z.  Although 
(3.3.1)  and  (3.3.2)  can  be  applied  to  obtain  certain 
pricing  functions,  however,  these  functions  might  not 
be  affire,  thus  the  original  assumption  might  be 
violated,  and  there  might  not  exist  a  pair  of  affine 
pricing  schemes  that  induces  them.  Therefore  the 
result  we  have  here  is  not  as  strong  as  those  in  the 
complanentary  case. 


4  Concluding  Remarks 

-  Pricing  problems  are  formulated  as  stochastic 
Stackelberg  gane3  with  non-nested  information 
structure,  and  studied  by  using  the  Inducible  Region 
concept.  It  was  thought  that  such  problems  were 
extremely  difficult  to  solve,  and  their  inducible 
regions  are  not  clearly  defined.  However,  by 
exploiting  the  special  structure  of  the  model  under 
consideration,  we  are  dole  to  delineate  the  complete 
Inducible  region  for  the  single-stage  pricing  problem. 
Die  optimal  inducible  consumption  curve  can  then  be 
obtained  by  solving  an  optimal  control  like  problem. 
The  corresponding  optimal  pricing  scheme  can  also  be 
constructed . 

Spence  studied  single-stage  pricing  probloas  In 
<S PE 77 >.  There  is  certain  mathematical  resemblance 
between  his  results  and  the  results  presented  in 
Section  2.  Some  of  our  ideas  are  indeed  motivated  by 
his  work.  In  <SPE7Y>,  he  first  substitutes  out  all 
the  r  's  by  (2.3. 2. 2),  and  then  maximizes  J.  over  the 
unrestricted  set  of  all  the  r^  's.  Ch  the  other  hand, 
in  the  inducible  region  approach,  we  first  substitute 
out  r0's  by  (2. 3.2.2)  or  (2. 3.2.3)  (they  are  shown  to 
b>  equal).  Instead  of  directly  maximizing  J„  over  the 
unrestricted  set  of  all  the  r^'s,  we  delineate  the  set 
of  ail  the  customers'  reactions  (SR).  We  then  find 
the  optimal  inducible  reaction  by  maximizing  J„  over 
SR.  Once  r  3  is  found,  r.  can  then  be  constructed 
from  (2,3. 2. 4).  Ihus  using  the  inducible  region 
approach,  we  guarantee  the  solution  obtained  from 
(P-4)  in  inducible,  however,  this  is  not  generally 
true  by  using  fence's  method.  For  problems  treated 
in  <SPE77>,  his  solutions  happened  to  be  in  SR,  thus 
he  was  able  to  get  the  results.  For  general  problems, 
this  will  rot  be  the  case.  We  then  have  ’m  rely  on 
the  inducible  region  concept. 

For  the  two-stage  pricing  problem  of  Section  3,  the 
dynanic  information  Involved  makes  it  more  complicated 
than  the  single-stage  problem,  and  the 
complement  /substitution  effect  of  the  product  at  two 
different  periods  begins  to  energe.  If  the  product  at 
different  periods  have  complement  effect,  we  delineate 
the  entire  Inducible  Region.  The  desired  consumption 
curves  can  be  found  by  solving  an  optimal  control  like 
problem  at  each  period,  and  the  optimal  pricing 
strategies  can  then  be  constructed.  On  the  other 
hand,  if  the  product  has  "weak"  substitution  effect, 
we  show  that  the  optimal  affine  pricing  schemes  can 
sometimes  be  constructed. 
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.. _  ABSTRACT 

The  problem  to  be  investigated  consists  of  determining 
the  optimal  locations  of  files  and  the  number  of  redundant 
copies  of  these  files,  in  a  vulnerable  communication^ 
network.  It  is  assumed  that  each  node  and  link  of 
the  communication  notw  >rk  can  fail  independently  of 
the  others.  The  optimization  problem  maximizes  the 
probabi'ity  that  a  commander  is  able  to  access  the  subset 
of  the  files  that  he  needs  while  minimizing  the  network¬ 
wide  costs.  These  network-wide  costs  are  storage  costs  and 
costs  due  to  the  time  delay  in  query  and  update  requests 
of  the  distributed  data  base  systepj.  .  ,  ^ 

The  problem  can  be  shown  to  reduce  to  a  zero-one  linear 
programming  problem.  We~wrtl  look  for  theorems  which 
reduce  the  complexity  of  the  solution  of  the  zero-one  linear 
programs.  Finally  a  heuristic  algorithm  lias  be  developed  to 
solve  the  zero-one  linear  program.  An  efficient  polynomial 
time  algorithm  has  been  developed  for  the  totally  reliable 
network  case.  WeAvUI  try  to  extend  the  efficient  polynomial 
time  algorithm  jlo  the  unreliable  network  case. 


1.  INTRODUCTION 


1.1.  Literature  Survey  of  Work  Done  on  Unreliable 
Network* 

Unreliable  networks  are  networks  in  which  the  nodes  or 
links  may  not  be  operational  at  any  time  due  to  failures. 
The  first  researchers  to  investigate  the  file  allocation 
problem  in  an  unreliable  networks  were  Mahmoud  and 
Riordon  (1.1].  Mahmoud  and  Riordon  developed  a  model 
in  which  not  only  the  allocation  or  files  were  optimized, 
but  also  the  optimal  capacities  of  communication  channels 
were  determined.  The  interesting  extension  with  respect 

to  previous  models  was  the  definition  of  availability  con¬ 
straints  measuring  the  probability  of  having  at  least  one 
copy  of  the  file  accessible  as  a  function  of  link  reliabilities, 
node  reliabilities  and  network  topology.  Mahmoud  and 
Riordon  also  used  network  delay  constraints.  They  devel¬ 
oped  a  model  in  which  not  only  the  allocation  of  files  were 
optimized  but  also  the  optimal  capacities  of  communica¬ 
tion  channels  were  determined.  The  delay  measured  used 
is  from  KIcinrock’s  [1.2]  book  ’’Stochastic  Message  (low 
«nd  Delay”.  The  delay  expression  assumes  fixed  routing, 
each  node  has  infinite  storage,  intermediate  nodes  incur  no 
processing  delay  as  a  message  travels  through  it  toward  its 


„  ^  P  °  t  >  r  l~- 

destination  and  the  assumption  that  each  time  a  message 
"tilers  a  node  a  new  length  is  chosen  from  the  exponential 
message  length  distribution.  The  use  of  fixed  routing  is 
wrnng  since  a  link  or  node  may  not  be  operational  at  the 
time  it  is  being  used. 

The  Mahmoud  and  Riordon  problem  reduces  to  a  nonlinear 
integer  programming  problem.  They  used  a  heuristic 
solution  technique  to  solve  the  problem.  The  solution 
algorithm  consisted  of  first  generating  a  number  of' feasible 
initial  solutions;  second  optimize  this  solution  by  successive 
add  itiguS  OF  utSctiOna  uf  uw  Cupits]  tuiiu  duupv  the 
lower  cost  allocations  and  repeat  step  two.  Finally  when 
additions  or  deletions  cannot  further  reduce  the  cost,  then 
this  allocation  is  considered  to  be  a  "local  optimum.” 
Each  time  the  algorithm  produces  a  local  optimum  it  is 
considered  as  an  "heuristic  run."  The  result  of  the  lowest 
cost  heuristic  run  is  adopted  as  the  optimal  file  allocation. 
The  heuristic  approach  is  assuming  that  one  or  more  of 
the  local  optimal  will  be  close  in  cost  to  the  actual  global 
optimum.  The  solution  must  satisfy  the  availability  and 
time  delay  constraints. 

Finally  file  allocation  in  a  distributed  computer  com¬ 
munication  network  with  adaptive  routing  has  been 
recently  published  by  Lanning  and  Leonard  [1.3].  They 
have  presented  an  algorithm  which  minimizes  storage  and 
message  transmission  costs  while  the  file  allocations  satisfy 
minimum  file  availability  and  message  delay  requirements. 
The  algorithm  uses  the  solution  to  a  p-median  problem 
to  find  an  initial  candidate  file  placement.  The  p-median 
problem  is  to  allocate  files  at  p  locations  so  as  to  minimize 
all  the  average  costs.  If  for  example  the  costs  were  replaced 
by  distance  then  the  simple  median  problem  would  be  to 
plarc  the  file  in  such  a  way  that  the  average  distance  from 
a  node  to  the  file  was  minimized.  For  a  p-median  problem 
one  would  try  to  place  the  files  in  such  a  way  that  the 
average  distance  from  a  node  to  a  copy  of  the  file  was 
minimized. 

Lanning  and  Leonard  used  the  p-median  problem  because 
he  requires  that  the  users  place  an  upperbound  on  the 
number  of  file  copies  say  p.  Lanning’s  algorithm  first 
solves  the  p  median  problem  for  each  p  up  to  the 
upperbound  P.  Second  the  algorithm  examines  each  of 
the  P  solutions  to  the  p-median  problems  and  determines 
if  the  locations  satisfy  the  availability  constraints.  If 
there  are  any  infeasibilities  the  algorithm  employs  a  set 
of  rules  to  eliminate  infeasibiiities.  Third  the  algorithm 
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examines  the  P  solutions  with  respect  to  delay  constraints. 

If  there  are  any  infeasibilities  the  algorithm  again  employs 
a  set  of  rules  to  eliminate  the  infeasibilitirs.  Once  all 
the  infeasibillties  have  been  resolved  the  best  of  the  P 
solutions  is  taken  as  the  optimal  solution,  banning  uses 
the  same  approximate  availability  measure  as  Mahmoud 
and  Riordon  for  the  availability  constraints  and  uses  a 
time  delay  approximation  for  the  time  delay  constraints. 

2.  TIME  DELAYS  IN  AN  UNRELIABLE  NETWORK 


2.1.  Basie  Problem 

The  problem  at  hand  is  a  completely  unreliable  system 
with  significant  time  delays.  Since  the  system  is  unreliable 
we  will  have  unaccessability  costs  and  because  the  system 
has  time  delays  we  will  also  have  communication  costs. 
Therefore  we  will  have  storage,  communication  and  non¬ 
accessibility  costs  in  our  objective  function.  We  want 
to  minimize  this  objective  function  subject  to  the  fixed 
capacity  constraints.  Therefore  our  formulation  for  the 
problem  at  hand  may  be  written  in  the  following  form: 


with  shorter  service  times  do  not  have  to  wait  as  long  as 
a  first  come  first  serve  (FCFS)  queue.  The  requests  with 
longer  service  times  have  a  slightly  longer  waiting  time 
than  FCFS,  but  it  is  more  than  counterbalanced  by  the 
reduced  waiting  times  for  the  requests  with  shorter  service 
limes.  Therefore  in  our  problem  formulation  queries  have 
higher  priorities  than  do  updates  since  queries  have  shorter 
service  times. 

We  also  may  have  priorities  due  to  other  factors.  These 
factors  include  importance  of  the  commander,  the  impor¬ 
tance  of  the  file  to  the  commander,  the  importance  of 
the  file  to  the  multiple  sensors.  The  waiting  times  due  to 
the  priorities  can  be  calculated  using  formulas  given  by 
Kleinrock’s  [2.1]  book  entitled  "Queueing  Systems  II”. 

2.3.  Fixed  Routing 

In  fixed  routing  the  routing  strategy  remains  fixed  once  it 
is  determined.  The  set  of  routing  variables,  <t\ ;(A)  define 
the  routing  that  is  to  be  done.  is  the  fraction  of 

traffic  emanating  out  of  node  i  that  is  destined  for  node  k 
on  link  (i,j).  these  set  of  routing  variables  can  be  found  by 
some  shortest  path  algorithm  subject  to  link  capacities. 


min  52  C(/i)  = 

'•  (=i 


min  52  12  12  Ti>ji=.]k  +  min  +  52  —  52  i)  —  52  52 

''  t=l  i=I  *€/i  €/‘  fc€/i  *=t  i=l;=l 

(2.1) 

s.t. 

x>0 

a,  —the  importance  of  commander  i 
lij,  =thc  value  of  file  j  to  commander  i 

P,i(li)  =thc  probability  that  file  I  is  accessible  to  the  commander 
at  node  i  given  an  assignment  /j 

Xji  =thc  volume  of  query  traffic  emanating  from  node  j  for  file  I 
Vjl  =the  volume  or  update  traffic  emanating  from  node  j  for  file  I 
H,*  =thc  time  delay  for  an  update  from  node  j  to  node  k 
fjk  =the  time  delay  for  a  query  from  node  j  to  node  k 
akj  =the  cost  of  locating  a  copy  of  a  file  j  at  the  k,h  node 
S  =the  number  of  sensors 

I'll]  —the  impui lance  of  file  I  when  data  from  sensors  i  and  j  are  fused 
PtJi(Ii)  =the  probability  that  file  I  is  accessible  sensors  i  and  j 
given  an  assignment  /,• 

/i  =the  set  of  node  indexes  representing  a  given  assignment  of  file  1 
2.2.  Queueing  Discipline 

We  require  the  most  efficient  queueing  discipline  to  reduce 
time  delays.  In  order  to  achieve  this  goal  we  must  allocate 
higher  priorities  to  the  requests  with  shorter  service  times 
and  lower  priorities  to  requests  with  longer  service  times. 

In  this  manner  the  average  waiting  time  per  request  is 
reduced.  This  is  true  because  the  request  with  longer 
service  time  must  always  wait  for  those  requests  with 
shorter  service  time,  while  the  requests  with  shorter  service 
times  do  not  wait  for  requests  with  longer  service  limes 
unless  they  arc  already  being  serviced.  In  this  way  requests 


Let  i,  be  the  generation  rates  of  messages  at  node  i,  X,.  be 
the  total  amount  of  query  traffic  emanating  out  of  node  i, 
and  t be  the  total  amount  of  update  traffic  emanating 
out  of  node  i.  We  then  have  t  he  following: 


<f,  =  Xt.  -f-  ti’i* 

(2.2) 

(2.3) 

*  l 
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Xj.  =  53  X.i 
1=1 


=  53  v>»i  (2.5) 

1-1 

Let  1/p  be  the  average  message  length.  Let  <)•,  be  the  total 
arrival  rate  of  messages  at  node  i.  In  other  words  0.,  is  the 
sum  of  messages  generated  at  node  i  and  those  in  transit 
through  node  i.  then  wc  have  the  following  relation: 

«-,  =  uEEW)  (2.6) 

«  i 

The  total  arrival  rate  of  messages  to  link  (i,j),  denoted  by 
9,,  is  given  by: 

«.,  =  *-.  £*„(*)  (2.7) 

k 


Therefore  if  we  can  determine  the  routing  variables  for 
an  unreliable  network,  then  wc  can  also  determine  the 
average  message  delay  for  an  unreliable  network. 


i#) 


Since  6"  corresponds  to  routing  variables  ♦Jy(fc),  it  must 
be  scaled  up  by  a  factor  of  because  the  link  is  only 
operational  a  fraction  of  the  time. 

2.5.  Time  Delay 

Let  Cij  be  the  capacity  of  link  (ij),  then  the  average 
message  delay  on  link  (i,j)  is  denoted  by  rt)  and  is  given 
by: 


2.4.  Variable  Routing 


suu  ivMuuig  iwi  miu  paiv  ui  wsiv  iwunuiauuii  niuao  uc 

changed.  In  the  previous  sections  we  have  used  fixed 
routing  strategies.  In  this  section  we  must  use  variable 
routing  strategies.  The  routing  must  be  done  in  such  a 
way  to  take  into  account  failing  links  or  changes  in  our 
topology  of  our  network.  Therefore  due  to  link  failures  the 
routing  variables  are  scaled  up  by  the  fraction  of  time  they 
are  operational.  This  is  simple  averaging  of  the  routing 
variables  with  respect  to  the  link  failure  probabilities. 

The  problem  is  now  how  to  change  this  routing  to  effect 
the  reliability  of  the  network.  Clearly  the  fixed  routing 
strategy  found  in  the  previous  paragraph  will  not  do  for 
our  unreliable  network.  Therefore  somehow  these  routing 
variables  must  be  changed  to  reflect  the  reliability.  We  can 
use  the  method  described  by  Li  [2.2].  Li  assumed  that  given 
a  routing  strategy  that  if  a  link  failed  that  the  remaining 
messages  will  flow  through  the  other  links  proportionally. 
In  this  case  we  can  then  find  new  routing  variables  based 
on  the  old  routing  variables  and  the  probability  of  link 
failures. 


_  _  .  g. 

where  irt]  is  the  probability  that  link  (i,j)  is  reliable  and 
G  is  the  set  of  all  links  in  the  network. 

This  approach  tc  redefining  the  routing  variables  assumes 
that  if  a  link  fails  the  remaining  links’  traffic  will  be  scaled 
up  by  the  fraction  of  their  operational  time. 

From  the  previous  chapter  on  time  delays  we  found  a 
recursive  equation  for  the  average  message  delay  from 
node  i  to  node  k  as: 

=  53  ^'iW[Tjk  +  tij]  (2.9) 

(tj)efi 


/iCy-et>  v 

The  average  delay,  T  for  messages  in  the  system  is: 

T  —  p  23  ®yr«>  (■ 


Where  E  is  the  set  of  all  links  in  the  network,  and  P  is 
the  total  message  generation  rate  given  by: 


The  average  message  delay  going  from  node  i  to  node  k  is 
denoted  by  7’,*,  and  is  given  by: 

r  .  =  I  T»  +  T,k  probability  4>l;(fc) 

'  l r,*  probability  ♦,*(*)  '  ‘  ' 

In  other  words  we  can  stale  a  recursive  equation  for  the 
average  message  delay  from  node  i  to  node  k  as: 

£  ♦o{*)Py*  +  T„l  (2.16) 

Mee 

equation  we  can  determine  the  average 
given  the  routing  variables  $!>(&). 

ion  of  Time  Delay- without  Priorities 

Let  be  the  total  delay  of  a  message  sent  to  node  k 
from  node  i  routed  over  link  (ij).  The  total  delay  is  due 
to  four  parts. 

1.  7i*=the  time  delay  from  node  :  to  node  k  routed 
over  some  set  of  paths 

2.  w,k=the  time  delay  due  to  queueing  or  the 
writing  time  at  node  k  before  being  processed 


/ 


X 
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3.  tj*=the  time  delay  due  to  the  service  time 

4.  T*i=the  time  delay  from  node  k  to  node  i  routed 
over  some  set  of  paths  for  confirmation. 

Then  the  following  equation  holds  for  total  message  delay: 


S ij(k)  —  To,  +  w,*  -+■  tik  +  Tin  (2-17) 
where  we  have  the  following  equalities: 


1) 


2) 


U,k  = 


1-- 


for  queries 
for  updates 


(2.18) 


for  queries 
for  updates 


(2.19) 


3)  The  values  for  Tl}  and  T},  are  set  in  the  recursive 
equation 

The  time  delay  has  been  calculated  as  the  time  delay  for  an 
M/G/l  queue  with  variable  service  rates.  In  actuality  this 
can  be  modeled  as  an  M/G/l  queue  with  priority  queueing 
and  variable  service  rates.  Using  a  good  queueing  discipline 
variable  service  rates  should  cause  priority  queueing.  In 
priority  queueing  the  messages  that  take  less  time  to 
service  should  not  have  to  wait  in  the  queue  for  messages 
that  take  more  time  to  service  unless  they  are  already  in 
the  process  of  being  serviced. 

2.5.3.  Calculation  of  Time  Delay  with  Priorities 

In  this  section  we  consider  the  realistic  case  in  which 
commanders  have  different  rank  and  desire  different  files. 
With  out  loss  of  generality  let  us  consider  the  one  file 
case.  In  our  application  we  have  L  commanders  and 
S  sensors.  The  commanders  have  different  rank  and 
different  commande.s  rate  the  importance  of  particular 


2.5  2.1.  Case  1 

This  is  the  case  of  0<p<L—  1: 


flp  -■ 


files  differently.  Therefore  the  importance  of  commanders 
and  importance  of  Bits  to  commanders  can  be  combined 
into  one  general  level  of  importance.  Combining  this  with 
the  importance  of  files  to  sensors  given  a  list  of  2L  + 
S  priorities.  The  reason  that  there  are  2L-fS  priorities 
is  because  commanders  can  query  and  update  files  while 
sensors  only  update,  commanders  can  request  data  on 
files  that  sensors  update  or  command  files.  Commanders 
can  also  update  command  files.  Sensors  only  update 
data  stored  on  files,  there  are  three  service  rates,  one 
for  commanders  query  1/rj,  one  for  commander  update 
1/p,  and  one  for  sensor  updates  1/f.  Since  the  service 
times  for  queries  are  assumed  to  be  negligible  the  queries 
always  have  the  highest  priorities,  the  service  time  for 
commanders  updates  is  much  less  than  that  for  sensor 
updates  since  changing  a  command  takes  less  time  than 
changing  data.  Therefore  the  commander  updates  have 
higher  priorities  than  sensor  updates.  The  arrival  rates  of 
commander  queries  to  node  i  is  X,,  the  arrival  rates  of 
commander  updates  to  node  i  is  and  the  arrival  rates 
of  sensor  updates  is  a,,  given  the  2L-f-S  priority  classes 
we  would  like  to  find  the  average  waiting  times  for  the 
different  classes.  This  result  is  found  in  Kleinrock’s  (1.2) 
book  "Queueing  Systems  II”  and  is  calculated  below.  The 
waiting  times  for  any  particular  class  is  given  by  Ilp: 


n,= 


y-iL+S  A,y* 

2^1-1  ~r 


(i  -  A<x;)(l  -  A,*;) 


2L+S 


(2.20) 


Using  the  above  equation  we  can  find  the  average  waiting 
times  in  the  queue  for  our  2I.+S  priority  classes.  Where 
A,  is  the  arrival  rate  of  the  itk  priority  class;  is  the 
average  service  time  of  the  ith  priority  class:  and  x?  >* 
the  second  moment  for  the  service  time  of  the  ith  priority 
class.  The  waiting  times  for  our  2L-+-S  priority  classes 
can  be  broken  down  into  six  different  cases.  The  six  cases 
are  as  follows:  0<p<L  —  1;  p  =»  L;  L  1  <p<2L  —  1; 
p  =  2 L;  2L<p<2L  +  S  —  1;  and  p  =»  2L  4-  5. 


The  service  rates  for  queries  is  zero  so  xf  and  Xi  are  zero 
for  l^*'</4 


'  (1  -  E..L+1  K  -  zttu x  <*,  j)(i  -  - 

(2.22) 

r^2L  &  i  a, 

ss  hlzJdL 1  ***  Z 
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3.  ACCESSIBILITY 

This  section  deals  with  the  problem  of  finding  the 
probability  of  accessibility  of  a  distributed  data  base 
system  in  a  fail'ng  computer  network.  Given  the  locations 
of  the  files  stored  in  the  system  along  with  the  probability  of 
link  and  node  failures,  one  can  determ.ne  the  accessibility 
of  a  user  to  any  particular  file  in  the  system.  A  user  is 
stationed  at  one  of  the  nodes  of  the  network,  while  any 
particular  file  may  have  multiple  copies  stored  throughout 
the  network.  Starting  with  the  link  and  node  failure 
probabilities  one  can  find  the  probabilities  of  inter-node 
accessibilities,  p,,’s.  An  inter-node  accessibilities,  p*,  is 
simply  the  probability  that  node  i  can  access  whatever  it 
at  node  j.  Once  the  inter-node  accessibilities  are  found, 
one  can  write  a  simple  form  for  the  accessibility  cf  a  user 
i  to  a  file  I  in  the  system,  P, /(/<),  in  terms  of  inter-node 
accessibilities  and  file  allocation  variables,  Xu.  Pu{h)  i* 
the  probability  of  accessibility  of  user  i  to  file  I,  given  the 
file  allocation  for  file  I  is  /j.  File  allocation  variables,  i,j, 
are  zero-one  stating  whether  or  not  file  1  is  stored  at  node 
i.  If  file  1  is  stored  at  node  i,  then  z,i  is  one;  on  the  other 
hand,  if  file  1  is  not  stored  at  node  i,  then  x,i  is  zero.  The 
purpose  of  finding  a  simple  form  for  P,i(It)  is  to  simplify 
the  task  optimal  file  allocation  optimization. 

3.1.  Introduction 

In  the  past  researchers  [3.3]  have  only  dealt  with  the 
inter-node  accessibilities.  This  is  because  one  simply 
wanted  to  find  the  accessibility  between  a  user  and  a 
service  node;  however,  in  the  context  of  optimal  file 
allocation  in  a  distributed  data  base  there  may  be 
more  than  one  service  node.  In  optimal  file  allocation 
in  a  distributed  data  base,  it  is  necessary  to  determine 
accessibility  between  users  and  their  desired  files  where 
there  may  be  multiple  copies  of  the  desired  files.  Mahmoud 
and  Riordon  [3.1],  and  Laning  and  Leonard  [3.2]  have 
only  dealt  with  approximations  of  accessibility  in  their 
optimal  file  allocation  optimizations.  They  only  have 
accessibility  issue  in  the  constraints  of  their  optimal  file 
allocation  optimizations.  In  *his  way,  their  formulation 
only  rejects  certain  file  allocations  because  of  the  low  level 
of  accessibility;  however,  they  do  not  try  to  maximize  the 
accessibility  between  the  users  and  the  files.  We  propose  to 
put  the  accessibility  issue  in  the  objective  function  of  our 
optimal  file  allocation  optimization;  therefore,  we  will  not 
only  minimize  the  cost  of  a  particular  file  allocation,  but 
also  maximize  the  probability  of  accessibility  of  user  to 
files.  In  our  formulation  we  will  use  the  exact  probability 
of  accessibility.  Once  the  probability  of  accessibility  is 
known  in  terms  of  the  allocation  variables,  zu,  and 
the  inter-node  accessibilities,  ptJ,  it  may  be  put  in  the 
objective  function.  Since  this  term  will  be  non-linear  in 
the  allocation  variables,  it  is  desired  to  find  a  simple  form 
of  the  probability  of  accessibility  so  that  one  may  easily 
linearize  the  objective  function.  First  one  mu  T  find  the 
inter-node  accessibilities. 

3.2.  Inter-Node  Accessibilities 

The  term  p,y  represents  the  probability  of  accessing  nods 
j  from  node  i  and  is  called  the  probability  of  inter-node 
accessibility.  Many  other  researchers  have  dealt  with  the 
problem  of  inter-node  accessibilities.  Hansler,  McAuliffe, 


and  Wilkov  [3.3]  have  found  an  exact  method  of  finding 
there  inter-node  accessibility.  Their  algorithm  first  finds 
a  collectively  exhaustive  list  of  primal  cutsets  of  the 
network.  A  primal  cutset  arc  a  set  of  links  when  removed 
denies  accessibility  between  the  desired  nodes.  The  next 
step  is  to  transform  the  list  to  a  mutually  exclusive  and 
collectively  exhaustive  list  of  primal  cutsets.  Then  the 
sum  of  the  probabilities  of  failure  of  each  primal  cutset 
is  the  probability  of  non-accessibility  between  the  nodes. 
This  is  the  complement  of  the  intcr-nodr  accessibility 
for  any  two  nodes.  This  may  be  extended  to  more  than 
two  nodes,  since  the  probability  of  accessibility  between 
three  nodes  can  not  be  easily  derived  from  the  probability 
of  accessibility  between  two  nodes  due  to  the  possibility 
of  dependence  of  link  failures.  Hansler,  McAulifTe,  and 
Wilkov  [3.3]  state  that  although  the  algorithm  could  take 
exponential  time  to  evaluate,  he  states  that  the  algorithm 
can  handle  networks  as  large  as  the  Arpanet. 


3.3.  Accessibility 


This  section  deals  with  the  problem  of  finding  the 
probability  of  accessibility  of  a  distributed  data  base 
systems  in  a  failing  network.  Given  the  location  of  the  files 
stored  in  the  system,  we  can  determine  the  accessibility 
of  a  user  from  any  particular  node  to  a  particular  file  in 
the  system.  Each  file  may  be  stored  a  multiple  number  of 
times.  We  present  a  simple  form  for  the  accessibility  of  a 
user  to  a  file  in  the  system. 


Let  us  consider  a  distributed  data  base  with  only  one  fiie 
stored  a  multiple  number  of  times.  For  our  problem  we 
can  consider  the  one  file  system  without  loss  of  generality 
since  we  are  only  concerned  with  the  accessibility  of  a  file 
to  t.  user.  We  wish  to  find  P,(f)  the  probability  of  user  i 
accessing  the  file  with  a  particular  file  allocation  I.  This 
probability  may  be  written  as  follows: 


P>U)—  £  Ilf1  —  xk)*3Pi] 
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(3.1) 

Where  z,  is  a  zero-one  variable  indicating  whether  the  f,le 
is  stored  at,  node  i  or  not.  If  the  value  of  z,  is  one  the  file 
is  stored  at  node  i;  on  the  other  hand,  if  the  value  is  zero 
the  file  is  not  stored  at  node  i.  Pl]  js  the  probability  of 
accessibility  between  nodes  i  and  j. 
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The  first  term  is  the  probability  of  accessibility  of  user  i 
to  the  file  stored  at  node  j  given  the  file  is  only  stored  at 
node  j.  The  second  term  is  the  probability  of  accessibility 
of  user  i  to  the  file  given  the  file  is  stored  only  at  nodes 
j  and  h.  This  probability  is  simply  the  probability  that 
user  i  can  access  the  file  from  node  j  or  the  probability 
that  user  i  can  not  access  the  file  at  node  j  but  can  access 
the  file  at  node  h.  Continuing  in  this  fashion  we  arrive  at 
the  last  term  which  states  that  the  file  is  stored  at  all  the 
nodes,  so  the  probability  of  accessibility  of  ustr  i  to  the 
file  then  becomes  the  probability  that  user  i  can  access 
the  file  from  node  j;  or  the  probability  that  user  i  can 
access  the  file  from  node  h  but  not  from  node  j;  or  the 
probability  that  user  i  can  access  the  file  from  node  g  but 
not  from  node  j  and  node  h;  or  etc.. 

We  wish  to  rewrite  this  probability  in  a  more  simplified 
form.  A  more  simplified  form  is  one  that  requires  fewer 
multiplications  and  the  first,  second,  third,  etc.,  and  higher 
order  terms  are  separated  into  different  terms.  One  would 
like  the  first,  second,  third,  etc.,  and  higher  order  terms 
to  be  separated  into  different  terms  because  then  it  is 
simple  to  linearize  the  zero-one  integer  program  that  is 
required  for  optimal  file  allocation,  no  matter  if  P,(I)  is  in 
the  constraint  or  in  the  objective  function.  The  simplified 
form  will  enable  one  to  see  any  special  structure  in  the 
equation.  The  form  that  we  desire  is  recursive  > nature, 
in  the  sense  that  the  probability  of  accessibility  for  N 
nodes  is  just  a  simple  relation  from  the  probability  of 
accessibility  for  N-l  nodes.  The  form  we  desire  is  the 
following: 


P *1)  —  £  x]P'i  —  ••• 

N  N  N 

+  M)W+'EE  n 

:sj 
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(3.2) 

where  px,  is  the  probability  of  accessibility  between  nodes 
i  and  j.  Let  the  probability  of  accessibility  between  user  i 
and  the  file  for  an  N  node  network  be  P(N).  Then  using 
the  above  formula  we  may  rewrite  P(N)  as: 

P{N)  =  znP.n[1  -  P(N  -  1)]  (3.3) 

The  proof  of  the  equivalence  is  done  in  the  proof  of 
the  theorem  in  the  appendix.  The  theorem  is  simply 
the  equivalence  of  the  two  forms  of  the  probability  of 
accessibility. 

With  this  form  of  the  probability  of  accessibility  it  is  easy 
to  see  the  impact  of  adding  another  node  >o  the  system. 

3.4.  Sigt  ’.ficance  Of  Results 

We  have  been  able  to  find  a  simple  and  exact  form  for 
the  probability  of  accessibility  between  a  user  and  a  file 
that  we  are  trying  to  allocate  in  our  optimal  file  allocation 
optimization.  A  proof  of  the  equivalence  of  the  simple 
form  and  the  definition  of  the  probability  of  accessibility 


is  given.  The  probability  of  accessibility  can  be  found 
in  terms  of  the  probability  of  inter-node  accessibilities 
and  the  allocation  variables.  In  turn  the  probability  of 
inter-node  accessibilities  can  be  found  in  terms  of  the 
node  and  link  failure  probabilities.  Therefore  we  can 
find  an  expression  for  the  probability  of  accessibility  in 
terms  of  the  node  and  link  failure  probabilities  and  the 
allocation  variables.  This  probability  of  accessibility  can 
be  placed  in  our  objective  function  of  our  optimal  file 
allocation  optimization  to  not  only  minimize  costs  but 
also  to  maximize  the  probabilities  of  accessibility.  Once 
this  is  done  the  non-linear  integer  program  to  solve  our 
optimal  file  allocation  can  be  linearized  to  a  linear  zero- 
one  program.  The  problem  now  remains  to  develop  a  fast 
algorithm  to  solve  our  linear  zero-one  program  for  optimal 
file  allocation  of  a  distributed  data  base  in  an  unreliable 
network  using  exact  probabilities  of  accessibility. 

3.5.  Non-Arcessihility  Costs 

There  arc  two  types  of  non-accessibility  costs.  There  is 
the  commander's  non-accessibility  costs  and  the  sensor’s 
non-accessibility  costs.  The  commander’s  non-accessibility 
cost  is  due  to  the  commander’s  desire  for  information 
stored  in  the  Distributed  Data  Rase,  there  is  a  cost 
associated  with  the  commanders  not  being  able  to  access 
the  information  due  to  link  failures.  This  models  the 
commander's  accessibility  costs.  In  the  case  of  the  sensors, 
multiple  sensors  may  br  feeding  data  into  a  particular  file 
(FUSION).  This  data  may  be  unusable  if  it  is  separate. 
Therefore,  the  data  must  be  fused  together.  This  may  i>ol 
be  possible  due  to  link  failures.  There  is  2  cost  sssscislvd 
with  a  sensor  not  being  able  to  update  the  needed  file,  this 
models  the  sensor’s  to  files  accessibility  costs,  for  example, 
if  one  sensor  is  tracking  an  object  and  another  sensor  is 
trying  to  identify  the  object,  then  it  is  dear  the  sensor’s 
data  individually  may  mean  nothing,  but  combined  they 
can  be  very  important. 

Let  us  define  the  non-accessibility  costc  that  we  are  trying 
to  minimize  in  our  formulation,  let  us  define  the  non¬ 
accessibility  cost  as  the  negative  value  of  the  probability 
of  accessibility  weighted  by  the  importance  of  the  files, 
commanders  and  (or)  sensors.  This  has  an  averaging  effect. 
In  other  words  the  formulation  will  try  to  allocate  the 
files  such  that  the  more  important  the  file  the  higher 
the  probability  of  accessibility,  let  us  therefo-*  try  to 
write  down  some  equations  relating  these  facts  to  the 
non-accessibility  of  commanders  and  sensors. 

3.5.1.  Commanders 

Let  us  define  what  are  our  unacccssability  costs  for 
commanders. 

We  define: 

h(R{N,))  =  (~4) 

which  denotes  the  accessibility  of  file  1  to  th*  commander 
at  node  i  weighted  by  the  importance  of  file  1  to  the 
commander  at  node  i,  /?(,.  /?(.Y„)  is  the  set  of  nodes 
accessible  to  node  i.  0u  is  the  importance  of  file  1  to 
commander  i.  A,(l)  is  a  zero-one  variable  denoting  whether 
the  file  1  is  accessible  to  node  i;  therefore,  A,(i)  is  one  if  the 
file  is  accessible  and  zero  if  the  file  is  inaccessible.  A  file 


.is  inaccessible  either  if  there  is  not  a  path  from  node  i  to 
a  node  which  file  1  is  stored  or  if  all  the  nodes  containing 
file  I  arc  destroyed.  Let  us  define  the  unaccessability  costs 
for  commanders  as: 

'  L 

-E  £  atI,(R(Nt))  (3.5} 

«»i 

In  other  words  the  unaccessability  costs  is  the  negative 
expected  value  of  accessibility  weighted  by  the  importance 
of  the  commander  a,  and  the  importance  of  tbe  file  to  the 
commander. 

If  we  now  examine  the  last  term  in  the  minimization,  we 
can  simplify  the  expression.  The  expected  value  may  be 
brought  inside  the  summation.  Since  the  importance  of  the 
commander  i  and  the  importance  of  file  1  to  commander 
i  are  not  probabilistic,  we  can  simply  take  the  expected 
value  of  the  accessibility.  However  the  expected  value  of 
the  accessibility  is  simply  the  probability  that  commander 
i  can  access  file  1  given  the  allocation  of  redundant  copies 
of  file  I  in  the  network.  We  hive: 

e  £  Q,/i(H(N,))|  =  Je  ; 

1<-1  J  L<- 1 

=  £  o.A£[A(I)];  (3S) 

1 

-  E  *i0iiPuii 0- 

«« i 

where 

1  if  Pr{3ks.t.lat  KkeR(N,)) 

P,,[h )  =  0  if  Prtffc  si.  I  at  NkqR[Nx)) 

0  if  Pr(Vk  s.t.  I  at  Nk,  Nk  destroyed) 

(3.7) 

Where  P,i(Ii)  for  one  file  1,  was  found  in  the  previous 
section  to  be: 


PiV)-  E  no -**)*,*] 
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N  N  N  \ 

+  E  E  n  (  —  Zk)X]Xh{p>i  4  (1  —  ••• 


1  *-1 


+EE - E  n 

a—I  *  =  ■ 

*>•  j> U 
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J>k  kyi> 
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**[ Pi]  4(1  —  Pxj)Ah  +  (1  —  P.;)(l  —  Pih)Pi,  •  •  •  1 
J  (3.5) 

which  simplifies  to  the  following: 


N 

Pi(t)=  £  xjPij  —  ... 

J-l 

N  N  N 

+HPEE  n*mPi« 

j»l  *— I 
*  *>l  "***> 

+  (—  1)N  E  II  XmPxm  4-  (— 1)JV+1  n  XmPim 

i-i ;;;  »-i 

(3.9) 

where  p,y  is  the  probability  of  accessibility  between  nodes 
i  and  j.  Substituting  the  above  equation  into  the  non- 
accessibility  cost  of  the  commander  gives  us  the  final 
expression  for  the  non-accessibility  costs  for  commanders. 

3.5.2.  Sensors 

The  question  is  how  do  the  sensors  effect  our  problem 
of  optimal  file  allocation  in  an  unreliable  network.  The 
problem  is  that  different  sensors  must  fuse  their  data 
in  data  files  so  that  their  information  becomes  useful; 
whereas  the  data  from  the  sensors  individually  is  not 
as  useful.  There  turns  out  to  be  a  large  difference  if 
the  data  in  the  different  sensors  are  fused  or  not  fused. 

This  is  due  to  military  applications  where  one  sensor 
may  be  tracking  position  while  the  other  sensor  may  be 
determining  identity.  In  this  case  it  is  very  important  to 
match  this  data  together,  since  if  one  docs  not  know  the 
identity  of  the  object  that  one  is  tracking  he  docs  not 
know  what  action  to  take. 

Our  problem  is  how  to  model  this  aspect  of  sensors. 

This  can  be  modeled  in  a  similar  way  as  we  modeled 
commanders.  There  will  lie  an  importance  of  a  file  to  a 
sensor.  Then  to  model  the  fact  that  the  sensor  data  must 
be  fused  together  we  assign  an  accessibility  variable,  which 
is  non-zero  only  if  the  file  is  accessible  to  both  sensors. 

There  are  S  sensors  in  the  system  then  the  term  that  is 
to  be  added  to  the  formulation  is  of  the  following  form: 


£[i  £  *.'<(*(‘Vt)nrt(iv,))j « £  vujAAi) ; 

s  s 

=  e  e^4^(4 

S  S 

=  E  E  UuiPM 

«*=i  y— i 

(3.10) 

where  UU}  is  the  usefulness  of  file  1  when  the  data  from 
sensors  i  and  j  are  fused  there;  and  A,;(f)  is  defined  below: 


j  1  if  3k  s.t.  I  at  NkeR{N,)  f)  R[Nj) 

^.j(0=|O  if  ^k  s.f.  I  at  Nk£R(Nt)  fl  H(1V,)  (3.11) 

(0  if  Vfc  s.t.  I  at  Nk,  Nk  destroyed 


Where  Px]i{li)  for  one  file  1  is  by  definition: 


p.Mi)  = 


(4-3) 


C(/)>C(/~(il)  Jot  each  z&X. 

Then  for  every  sequence  /<('),  1{^\ . . I&\  which  are  sub¬ 
sets  of  X,  such  that  /<(*)  has  k  elements  and  li M  3/j(k  +  I)j 
the  following  is  true: 


C(/)  >  C(/~  /?( ’> )  >  C(/~/?'2>)  >  C(I-R'3) )  > . .  .<7(/~ft(r>) 

(44) 

4.4.  Theorem  IV  in  words  and  its  implications 

Theorem  IV  states  that  if  a  given  vertex  has  a  cost  greater 
than  the  cost  of  any  vertex  along  the  path  leading  from  it, 
then  the  sequence  of  costs  encountered  along  any  one  of 
these  paths  increase  inonotonically.  Thus  in  order  to  find 
the  optimal  allocation  policy,  it  is  sufficient  to  follow  every 
path  of  the  cost  graph  in  the  reverse  direction  until  the 
cost  decreases  and  no  further.  This  will  give  an  allocation 
of  local  optimum  of  which  the  global  optimum  is  one  of 
them. 


_  t  s 

+  E  cki  ~  E  *i0hPti(h)  —  E 

*ca  . -i  ,“1 

(3.13) 

4.  NEW  THEOREMS 

The  new  theorems  presented  here  along  with  theorems  III 
and  IV  or  last  year’s  paper  [4.1)  show  that  the  objective 
function  is  convex  in  the  number  of  redundant  file  copies. 

Tsing  this  result,  we  may  find  an  efficient  heuristic 
algorithm  to  solve  the  optimal  file  allocation  problem  by 
an  efficient  search  for  the  optimal  number  of  redundant 
file  copies.  4.5.  Theorem  XV 

4.1.  Theorem  XIII  As  the  number  of  redundant  copies  of  a  file  increases  the 

objective  function  is  convex.  In  other  words  (he  objective 
function  first  decreases  to  a  local  minimum,  then  increases 
as  the  numner  of  redundant  file  copies  increase. 

C’(/~lfcl)>C(/~(l,2])  fork  =  1,2.  (4.1) 

*  hcorem  \\  in  words  and  its  implications 

Theoi cm  XV  simply  states  that  the  objective  function 
is  convix  in  the  number  of  redundant  file  copies.  This 
will  enable  us  to  develop  algorithms  that  will  solve  our 
formulation.  The  worst  case  or  our  algorithm  will  be  better 
than  any  previous  algorithm. 


Theorem  XIII  states  that  on  our  cost  graph  if  two 
doccndcnts  of  a  given  vertex  have  a  cost  greater  laan 
the  cost  of  the  vertex,  then  the  cost  of  the  dcccndcnts  of 
the  two  vertices  is  greater  than  cither  or  the  two  vertices. 

4.3.  Theorem  IV 

Given  an  index  set  X'JI,  containing  r  elements  with  the 
following  property: 
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then 

C(I)>C(l~~\k\)  fork  =1,2,  (4.2) 

4.2.  Theorem  Xlfl  Sn  words  and  its  implications 


This  allows  us  to  reduce  the  solution  space  or  the  integer 
program.  Once  we  find  a  local  optimum  then  we  know 
that  any  more  file  allocation  is  not  required  so  the 
integer  program  will  not  have  to  search  for  solutions  in 
that  part  of  the  solution  space. 
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where  pt}k  is  the  probability  of  accessibility  between  nodes 
i  and  k,  and  nodes  j  and  k.  The  problem  with  this  new 
formulation  is  that  it  is  extremely  difficult  to  find  the 
P,,*’s  since  p,„  can  not  be  found  easily  from  p,k  and  p,k 
since  these  tvs  probabilities  are  not  independent.  These 
two  probabilities  are  not  independent  because  the  path 
taken  from  node  i  to  node  k  may  cross  the  path  t  >kcn  from 
node  j  to  node  k.  The  final  reformulation  is  as  follows: 


m,in  E  c(h)  = 

''  i=t 


5.  LINEARIZATION 
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r  n  r  n 

EE 

li-i  [fc-i 


'I’jldjktlk  +  min  \jtdjk 


N  L 

+  52  aktxlk  —  52  aiP-iPilW 

*-l  t-1 

(5.1) 


The  problem  with  the  formulation  stated  above  is  that 
there  are  two  non-lincar  terms  in  the  formulation.  These 
terms  will  be  seen  to  be  non-linear  in  the  x’s  which 
are  sero-onc  integers.  Theiefore  our  formulation  as  it 
currently  stands  is  a  non-linear  zero-one  integer  program. 
However  as  it  will  soon  be  seen  that  a  non-linear  zero- 
one  integer  program  can  be  changed  to  a  linear  zero-one 
integer  program  with  the  addition  of  some  simple  linear 
constraints.  The  two  terms  that  are  non-linear  are  listed 
below: 


2)  £  *,0uPdh) 

im  1 


(5.2) 


(5.3) 


The  non-linear  portion  of  the  above  non-linear  terms  is 
simply: 


l)min 


2  )PM) 


(5.4) 


(5.5) 


Now  without  loss  of  generality  we  can  assume  the  case  of 
only  one  file,  so  let  /  =  /;.  With  this  our  two  non-linear 
terms  are  as  follows: 


l)min  X,  djk 


2)  52  aMV 


(5.6) 


(5.7) 


In  the  second  equation  the  non-linear  term  is  /*<(/),  which 
has  been  proven  in  theorem  XII  to  be: 


pM=  E  xlP>3-  ■■■ 

>«» 

N  N  N 

■+■  ( — 5  E  II  ZmPtm 

•  — 1  >■!  *«  — l 

*  *  *>J  •*•#> 

wt  jrf* 

+  ( — 1)^  53  II  x^Pim  4*  ( — l)^^1  IJ  xmPim 

j=*i  "»-■  m=*l 

(5.8) 

Here  clearly  there  are  non-linear  terms  in  each  sum  except 
the  first  sum.  The  non-linear  terms  are  in  x.  Now  lets  see 
if  we  can  also  find  an  equivalent  representation  for  the 
first  non-linear  term. 


• 


The  first  non-linear  term  is  the  minimization  over  all  I  of 
However  the  terms  X;  and  d/*  are  simply  the  query 
traffic  emanating  from  node  j  and  the  communication 
cost  from  node  j  to  the  file  at  node  k  respectively.  These 
terms  are  known  apriori,  therefoir  it  should  be  possible  to 
represent  the  minimization  as  a  series  of  non-linear  terme. 
This  is  indeed  the  case.  Without  loss  of  generality  let  us 
suppose  that  we  can  reorder  the  product  of  these  terms 
from  smallest  to  the  largest,  with  the  smallest  having  the 
smaller  index  k.  Then  let  us  define  the  following: 


Pjk  — 


(5.9) 


Now  assuming  that  the  p}*’s  arc  ordered  in  increasing 
order  we  can  represent  the  minimization  as  a  sum  of 
non-lincar  terms  as  follows: 


min  X.d.fc : 
*€/  3  3 


1131X1 + M3i(1  -*«)*»+•••  *,n  n'd  -  Xm)x„ 


(5.10) 

Now  once  again  we  see  that  these  terms  are  once  again 
non-lincar  in  terms  of  the  zero-one  x’s  of  of  integer 
program. 

To  reduce  the  above  non-lincar  zero-one  problem  to  a 
linear  zero-one  problem,  we  first  consider  the  objective 
function.  Let  us  define: 


Xij..,n y  —  At Xj  .  .  .XyXy  q  —  2,  .  .  .  ,Q  (5.11) 

which  takes  value  zero  or  one,  where  Q  is  the  highest 
degree  of  nonlinearity.  We  then  represent  each  nonlinear 
term  in  the  objective  function  by  terms  of  the  above  form 
and  then  examine  its  coefficient.  If  the  coefficient  of  the 
non-linear  term  is  positive,  we  introduce  the  following 
constraint  equation: 


A)  -f-  Xj  -f-  ...  +  X%  +  Xv  —  q  •+■  1  <  (5.12) 

If  the  coefficient  of  the  non-linear  term  is  negative,  we 
introduce  the  following  constraint  equation: 


Xi+Xj+  ...+X%  +  XV>  qXij...w 


(5.13) 
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algorithms 


i.t.  Algorithm  I 

5«r  objective  function  has  been  shown  to  be  convex  m 
Jie  number  of  redundant  file  copie*  in  theorem  XV.  Since 
jur  objective  function  i*  convex,  one  can  do  a  search 
>n  the  number  of  redundant  file  copies  using  the  golden 
•alio  to  find  the  optimal  solution,  while  using  the  linear 
programming  integer  solutions  from  the  reliable  network 
solution  as  a  starting  point.  Let  N  be  the  number  of  nodes 
in  the  network. 

The  steps  of  the  algorithm  are  as  follows: 

1.  Set  the  maximum  number  of  file  copies  to  r,  from 
theorem  1  and  II.  Find  the  cost  for  r  optimally 
placed  copies  with  cost  Ir 

2.  Eliminate  all  Q  nodes  that  cost  too  much 

3.  Automatically  add  M  nodes  that  cost  so  little  so 
as  to  always  be  profitable 

4.  Find  the  linear  programming  solution  to  the 
underlying  reliable  network.  The  number  of 
integer  file  allocations  to  the  linear  program 
is  1’.  Use  these  P  file  allocations  as  an  initial 
file  allocation.  Find  the  cost  of  the  optimal  file 
allocation  for  P  redundant  copies.  This  cost  is 

5.  Find  the  optimal  file  allocations  for 

(6i) 


At  =  P  -f-  tnlejeri 


(r—PP-v/S), 


(6.2) 


redundant  file  allocations  with  cost  Ib. 

6.  If  A\—Ai,  then  is  the  optimal  number 

of  redundant  file  copies.  Simply  use  the  optima) 
file  allocations  for  Ai*=Ai  redundant  file  copies. 

7. 

7.1.  If  Ip  =  max[/^,  U,  Ib,  In]  go  to  sup  8 

7.2.  If  7r  =  ma x[/p,/a,  Ir,Ir]  go  to  sup  9 

8.  Set  P=Ai  and  /p==/x.  Find  the  optimal  fil* 
allocations  for 


A, 


(r-P)(Vs-  1) 


(6.3) 


redundant  file  allocations,  with  cost  /a-  Go  to 
step  6. 

9.  Set  t—Ai  and  Ir-Ir-  Find  the  optimal  file 
allocations  for 


(r-p)(3-s/S) 
A, - - - 


(6.4) 


redundant  file  allocations  with  cost  I  a-  Find  the 
optimal  file  allocations  for 


min  X.d«>  *=  P)iz\  +  Mya(l  —  *i)*a  +  •  •  •  My N  II  (l  ~  *»)*w 


redundant  file  allocations,  with  cost  Ib-  Go  to 
step  6. 

Since  the  objective  function  is  convex,  the  above  algorithm 
must  converge  to  the  solution. 

6.2.  Algorithm  U 

For  the  totally  reliable  network  let  us  examine  the  non¬ 
linear  term: 


N-l 


A,  J 


*€/ 

Let  us  redefine  the  variables: 

*i  -  0  -  *l) 

*J  *  (!  -  **) 

=  (1  —  *rv) 

then  the  non-linear  term  becomes: 


m=»I 

(6.5) 


(6.6) 
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Now  let's  linearise  the  equation  except  the  last  term 
adding  the  appropriate  constraints. 


MjI+fPj*— /*j|)»1+(Mj3— h,(N~  I ))V  123 ...  JV  —  1 

(6.10) 

*1  •+•  +  2  —  1  <  y\i 

*l+*2  +  *3  +  3— (6-11) 

*1  H-  **  +  •  •  •  *Ar  -l  <  ViJ3 ... N- 1 

The  reason  we  can  expand  this  in  this  manner  is  that 
(/j;(,+  H  —  ftji)  is  always  positive  since  they  are  ordered  in 
ascending  order.  Therefore  the  coefficients  of  the  non-linear 
terms  are  always  positive  and  the  non-linear  terms  may 
be  linearized  using  the  above  constraints  The  constraint 
matrix  is. 


(\  0 

0  . 

.  0  —1 

0  0  . 

•  0  ) 

1 1 

0  . 

.  0  0 

-1  0  . 

0 

1 1 

1  . 

.  0  0 

0  —1  . 

.  0 

(6.12) 

a  1 

1  . 

.  1  0 

0  0. 

.  -J 

We  want  tc  show  that  this  matrix  is  unimodular.  Then 
si-nply  solve  the  problem  in  the  Vs  and  take  the  inverse 
to  find  the  x’s.  This  problem  can  be  solved  in  polynomial 
time  since  the  constraint  matrix  is  unimodular.  It  can 
also  be  solved  using  the  simplex  method  which  will  yield 
only  integer  solutions.  The  simplex  method  needs  to  be 
calculated  twice  since  the  highest-order  non-linear  term 
has  a  negative  coefficient  and  cannot  be  expanded  in  this 
manner.  Therefore  once  we  form  tl.e  constraint  equations 
that  linearize  the  formulation,  we  solve  the  problem  with 
the  simplex  method  once  with  the  highest-order  non-uuear 
term  taking  the  value  zero  and  once  with  the  value  one. 


6.2.1.  Steps 

These  are  the  steps  of  the  algorithm: 

1.  Eliminate  all  Q  nodes  that  cost  too  much 

2.  Automatically  add  M  nodes  that  cost  so  little  so 
as  to  always  be  profitable 

3.  Solve  the  problem  with  the  simplex  method  y 
twice,  once  for  the  highest  non-linear  term  zero 
and  once  with  the  the  highest  non-linear  term 
one.  The  minimum  of  these  two  results  is  the 
optimal  solution. 

7.  CONCLUSIONS 

We  have  formulated  the  file  ailoeation  problem  in  a  C3 
context  where  vulnerability  is  an  issue.  We  consider  the 
problem  of  optimally  locating  multiple  copies  or  files  in  an 
unreliable  computer  network  with  significant  time  delays. 
Our  formulation  considers: 


1.  The  probability  of  commander  accessing  files; 

2.  The  probability  of  multiple  sensors  accessing 

3.  The  importance  of  commanders; 

4.  The  importance  of  sensois; 

5.  The  importance  of  particular  files  to  particular 
commanders. 

6.  Variable  routing; 

7.  Priority  queueing; 

8.  Time  delays. 

The  theorems  have  provided  ways  to  cut  down  on  the 
possible  fiic  allocations  (solution  space)  in  which  the  integer 
program  has  to  search.  Therefore,  we  reduce  the  amount 
of  time  requited  to  solve  for  a  solution  using  integer 
programming. 

We  have  extended  and  proved  fifteen  theorems,  all 
r.pplicable  to  the  new  formulation.  These  theorems  reduce 
the  search  space  of  the  integer  program  and  also  prove 
that  the  objective  function  is  convex  in  the  number  of 
redundant  file  copies. 

We  have  shown  how  to  linearize  the  formulation  to  produce 
a  linear  zero-one  integer  programming  formulation. 


We  have  developed  two  fast  algorithms  to  solve  the  problem 
of  optimal  file  allocation  in  the  reliable  and  unreliable 
networks.  For  the  reliable  network  problem  our  algorithm 
finds  a  solution  in  polynomial  time.  This  is  a  great  advance 
over  previous  algorithms  which  all  requi.ed  heuristic  non¬ 
polynomial  time  algorithms.  For  the  unreliable  network 
our  heuristic  algorithm  takes  advantage  of  the  structure 
of  the  objective  function  of  our  problem.  In  the  theorems 
presented,  we  have  shown  that  the  objective  function  is 
convex  in  the  number  of  redundant  file  copies.  Using  this 
result  we  can  simply  use  a  -earch  based  on  the  golden  ratio 
which  will  find  the  optimal  number  of  redundant  file  copies. 
This  algorithm  will  never  need  an  exhaustive  search, 
so  it  is  superior  to  all  integer  programming  solutions. 
This  algorithm  will  also  find  an  exact  solution  which  is 
superior  to  all  other  heuristic  algorithms  that  just  find 
approximations  to  the  solutions. 

Further  work  will  be  concentrated  on  simulating  these 
algorithms  on  a  computer.  We  will  also  conduct  scnsitivjty 
analysis  on  these  results  for  networks  up  to  fifteen  nodes. 
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ABSTRACT 

In  this  paper,  we  consider  the  random-access  of  a 
single  slotted  channel  by  a  finite  number  of  indepen¬ 
dent,  data  transmitting  bursty  users.  We  adopt  the 
assumption  that  each  user  monitors  the  channel  only 
while  he  is  blocked.  We  also  assume  that  the  channel 
outcomes  (visible  to  each  user)  are  binary.  That  is, 
each  channel  slot  is  perceived  as  either  a  noncollision, 
or  as  a  collision  slot.  We  disregard  propagation  de¬ 
lays.  For  the  above  model,  we  propose  and  analyze  a 
collision  resolution  protocol  (BCRLS)  with  tree  search 
characteristics.  For  identical  users  with  binomial 
transmission  processes,  we  find  lower  bounds  or.  the 
BCRLS  througnput,  and  we  compute  upper  bounds  on  the 
induced  delays  in  transmission.  We  compare  our  results 
with  those  induced  by  the  dynamic  tree  protocol  of 
Capetanakis;  where  the  feedback  sensing  is  continuous 
in  the  latter. 

I.  INTRODUCTION 

We  consider  a  number  of  data-transmitting  bursty 
users  who  request  access  to  a  single  network  resource. 

We  assume  that  the  users  do  not.  communicate  with  each 
other  directly,  and  that  their  data  are  formatted  into 
packets  of  identical  length.  Such  a  user  model  arises, 
for  example,  when  a  number  <.  f  computer  terminals  access 
a  single  host  computer.  Let  us  further  assume  that  the 
network  resource  is  a  single  transmission  channel  whose 
time  is  slotted.  The  length  of  each  channel  time  slot 
is  equal  to  one  packet.  Also,  each  user  can  attempt 
transmission  of  a  packet,  starting  only  at  the  beginning 
of  a  slot.  Given  the  general  model  above,  a  variety  of 
transmission  protocols  can  be  devised  depending  on  the 
specific  characteristics  of  the  user  end  channel  models. 
Such  characteristics  include  finite  number  of  well- 
identified  users  versus  an  asymptotically  large  number 
of  ill-specified  users,  as  well  as  various  levels  of 
feedback  information  provided  to  the  users  by  the 
channel ,  [11'/]. 

In  the  present  paper,  we  assume  finitely  many 
users,  and  we  adopt  similar  user  and  channel  models  as 
in  J4,6].  In  contrast  to  those  models  as  well  as  to 
the  col lision-f ree  models,  however,  we  assume  that  each 
user  inspects  the  broadcast  feedback  only  while  he  is 
blocked.  By  blocking  we  mean  the  existence  of  some 
unsuccessfully  transmitted  packet  in  the  user's  buffer. 
Our  assumption  is  the  same  as  in  [14],  and  it  eliminates 
the  often  undesirable  requirement  that  all  users  monitor 
c^e  channel  constantly,  even  when  empty.  In  contrast  to 
[14],  we  assure  that  the  feedback  is  binarv,  distin¬ 
guishing  between  collision  and  no:. collision  slots.  For 
the  above  model  we  propose  and  analyze  a  collision 
resolution  protocol.  We  name  this  protocol  Binary 
Collision  Resolution  with  Limited  Sensing  (BCRLS). 


This  work  was  supported  jointly  by  the  National  Science 
Foundation  under  the  Grant  ECS-8H9335i  and  the  Air 
Force  Office  of  Scientific  Research  under  Grant  AFOSR- 
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II,  THE  BCRLS  PROTOCOL 

As  in  [6],  we  assume  that  2n  identical,  indepedent 
ana  packet-transmitting  users  share  a  single  slotted 
channel.  If  a  single  packet  is  transmitted  within  a 
slot,  ic  is  received  correctly.  If  at  least  two 
packets  are  simultaneously  transmitted,  a  collision 
occurs,  all  information  contained  in  the  collided 
packets  is  lost,  and  retransmission  is  necessary.  The 
binary  outcome  of  each  slot  (empty  or  busy  with  one 
packet,  collision)  is  broadcasted  to  all  users  without 
propagation  delays.  Let  the  system  start  operating  at 
time  zero,  and  let  time  he  measured  in  slot  units. 
Initially,  each  user  does  not  inspect  the  feedback,  and 
he  is  free  to  transmit  a  packet  in  any  slot.  Let  some 
user  transmit  his  first  packet  at  time  t.  He  then 
inspects  the  feedback  that  corresponds  to  slot  t.  If 
he  sees  success,  he  stops  inspecting  the  feedback  until 
his  next  transmission.  If,  instead,  he  sees  a  colli¬ 
sion,  he  initiates  the  BCRLS  protocol  for  collision 
resolution,  while  inspecting  the  feedback  continuously. 
The  user  perceives  the  collision  as  resolved,  as  soon 
as  his  collided  packet  is  successfully  transmitted.  He 
then  stops  inspecting  the  feedback,  until  his  next 
transmission.  Transmission  of  new  arrivals  is  not 
attempted  by  the  user,  until  his  own  collision  has  been 
resolved.  In  this  section,  we  will  briefly  describe 
the  BCRLS  protocol  and  we  will  analyze  its  operational 
characteristics.  Since  those  aspects  are  independent 
of  the  packet-generating  process  per  user,  we  will  make 
no  assumptions  on  the  latter  at  this  point.  We  will 
nted  such  assumptions,  only  when  we  study  the  expected 
delays  induced  by  the  protocol,  and  its  stability 
properties.  More  details  on  the  BCRLS  can  be  found  in 
[18]. 

A.  The  ECRLS  General  Operation 

Given  2n  users,  consider  the  binary  tree  with  2n 
leaves.  The  tree  has  n+1  levels  of  depth,  numbered 
from  0  to  n.  Depths  0  and  n  correspond  respectively  to 
the  root  and  the  leases  of  the  tree.  In  general,  there 
exist  2i  nodes  at  depth  i.  Each  of  these  noides  is  the 
root  of  a  binary  subtree  with  2n”  leaves.  Each  tree 
node  beyond  depth  0  is  identified  by  a  binary  codeword, 
where  the  codeword  of  each  node  at  depth  i:i>l  contains 
i  oitc.  Consider  the  depths  of  the  binary  tree 
evolving  sequentially  from  left  to  right  (as  in  figure 
TI.l),  and  let  some  node  at  depth  i:l£i£n-l  be  identi¬ 
fied  by  the  binary  codeword  Then,  the  two 

nodes  at  depth  i+1  that  branch  off  node  X]X2***xi  are 
identified  by  the  codewords  x^.-.x^O  and  x^^.x^l;  node 
x1...xil  lies  under  node  x^.-.x^O,  The  two  nodes  at 
depth  1  that  branch  off  the  tree  root  R  are  identified 
by  the  length  one  codewords  0  end  1,  where  node  1  lies 
under  node  0.  It  is  clear  from  the  above  that  each 
binary  codeword  xj . .  .:;j  :I<i<n  identifies  a  single  tree 
node  at  depth  i;  thus,  it  also  identifies  the  unique 
path  that  connects  this  node  with  the  tree  root.  In 
particular,  ea«_n  one  of  the  2n  distinct  binary  code¬ 
words  of  length  n  identifies  a  single  tree  leaf.  Con¬ 
sider  now  a  one-to-one  correspondence  between  the  2n 
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users  and  the  2n  binary  codewords  of  length  n.  Then, 
each  encoded  user  Is  uniquely  identified  by  a  single 
binary  codeword  of  length  n.  But,  as  we  saw  above, 
each  such  codeword  also  identifies  a  single  leaf  on  the 
(n+i)  -  depth  binary  tree.  Thus,  there  exists  a  one- 
to-one  correspondence  between  those  tree  leaves  and  the 
2n  encoded  users.  Let  the  2n  users  be  encoded  by 
binary  sequences  of  length  n,  and  let  each  user  have  in 
its  memory  a  reproduction  of  the  (n+1)  -  depth  binary 
tree.  Each  user  considers  himself  placed  on  the  tree 
leaf  whose  codeword  coincides  with  his  own.  If  we 
number  the  2n  users  from  1  to  2n,  the  codeword  uf  the 
ith  user  is  the  length  n  binary  representation  of  the 
number  i-1.  From  now  on  we  will  identify  some  user 
either  by  his  number  or  by  hiw  codeword.  Let  t0  be 
some  time  Instant  such  that  all  collisions  involving 
packets  from  user  i  have  been  previously  resolved.  Let 
user  i  transmit  again  at  time  t,  where  t>tQ  and  the 
user  did  not  transmit  in  the  time  interval  (t0.t).  Let 
there  be  a  collision  in  slot  t.  User  i  observes  this 
collision  and  starts  the  motions  for  its  resolution.  In 
the  process,  he  uses  the  (n+1)  -  depth  binary  tree  in 
his  memory,  inspecting  continuously  the  feedback.  He 
traces  sequentially  neighboring  tree  nodes,  in  an  order 
dictated  by  the  feedback.  At  the  same  time,  he  imagines 
himself  placed  at  different  tree  nodes  depending  on  the 
relctlonsulp  between  the  current  node  in  the  trace  and 
the  user's  codeword.  The  user  transmits,  when  the  node 
he  has  placed  himself  at  coincides  with  the  trace  node. 
To  explain  the  collision  resolution  process  coherently, 
we  need  to  discriminate  between  node  tracing  ar.d  user 
node  self-placing.  To  do  that,  we  first  present  the 
following  definition. 

Definition  1 

At  some  point  in  time,  user  i  is  blocked  if  he  is 
in  the  process  of  resolving  a  collision.  The  blocked 
user  i  is  in  resolution  mode  2*:0<k<n  if  he  imagines 
himself  placed  at  a  node  that  lies  at  the  tree  depth 
n-k.  If  k<n,  and  if  xj^.-.Xj,  Is  the  user's  codeword, 
the  node's  codeword  is  Xj^.  •  xn-k‘  Thf-  user  Is  in 
k  k 

resolution  mode  2  and  active  (2*)  if  he  is  in  resolu¬ 
tion  mode  2^  and  transmits.  He  is  In  resolution  mode 
2  and  withholding  (2,,)  if  he  is  in  resolution  mode  2^ 
and  doe3  not  transmit.  The  blocked  user  i  is  in  state 
(2a).  yjL...yf)(2^,  v1...yf):£<n  if  he  is  in  resolution 
mode  2k,  his  node  tracing  has  reached  the  tree  node 
whose  binary  codeword  is  y^...y£,  and  he  is  respectively 
rctive  or  withholding. 


Let  x^...xn  be  the  codeword  of  user  i.  Let  the 
user  be  unblocked  at  time  t-1,  and  let  him  transmit  and 
be  blocked  at  time  t.  Then,  the  BCRLS  protocol  per¬ 
formed  by  the  user  is  described  by  the  following  state¬ 
ments. 

1.  At  time  t,  the  user  imagines  himself  placed  at  the 
root  of  the  tree  and  transmitting.  Observing  collision, 
he  also  starts  his  node-tracing  at  the  tree  root.  Thus, 
at  time  t  user  i  is  in  state  (2j|,R). 

2.  Let  at  time  t^  user  i  be  blocked  and  in  resolution 

mode  2k;0<k<n.  Then,  by  definition  1,  the  user  has 
placed  himself  at  node  •••*„_!,•  If  he  is  in  resolu¬ 
tion  mode  2a,  he  transmits  in  slot  tj.  If  he  i3  in 
resolution  mode  2*,  he  does  not.  He  cannot  be  in  reso¬ 
lution  mode  2a,  unless  his  node-tracing  has  reached 
node  Let  the  user  be  in  resolution  mode  2a 

at  time  tj_.  Then, 

i)  If  he  observes  success  in  slot  tj,  he  becomes 
unblocked . 

ii)  If  he  observes  collision  in  sicr  tj,  he  moves 
at  time  tj+1  to  resolution  mode : 


„k-l 


2a  ;  “  Vk+l  -  k  >  * 


,k-l 
2  ;  if  x 


n-k+1 


1,  and  k  >.  1 


2^  if  x^.  *  0,  and  k  *  0 

2n_1;  if  x,  -  1,  and  k  0 
w  1 


Let  the  user  be  in  resolution  mode  2{J  at  time  Then, 

i)  If  he  observes  a  collision  in  slot  t^»  he 
moves  at  time  tj+1  to  resolution  mode: 


2"_1;  if  k  -  0,  and  xx 


2n  *■;  if  k  *  0,  and  x,  *  1 
w  ’  ’  1 

I. 

Otherwise,  he  remains  in  resolution  mode  2ft* 

w 

ii)  If  he  observes  an  either  empty  or  a  success¬ 
ful  slot  tj,  he  mvoes  at  time  Cj+1  to  resolution 
mode : 

k-1 

2  ;  if  his  node-tracing  reaches  node 

xl"-*n-k+l  at  tlae  *1+1- 

V 

Otherwise,  he  remains  in  resolution  mode  2W. 

3.  Let  at  time  t^  user  i  be  blocked.  Let  y0yi*-*y£5 
0<f<n  be  the  node  reached  by  the  user's  node— tracing  at 
time  t^;  where  if  £» 0  the  node  is  the  tree  root.  Then, 


i)  If  the  user  observes  a  collision  at  t^,  he 
moves  at  time  tj+l  his  node-tracing  to  nooc: 

yi*..y£0;  if  l£-t<n-l 

0;  if  either  i-0  or  n 


ii)  If  the  user  is  in  withholding  resolution  mode 
St  time  tj,  and  observes  either  empty  or  successful 
slot  t^,  he  moves  at  time  tj+1  his  node-tracing  to 
node: 


yl'"Vl  1;  1-mzt 

;  where  m:yj*l;  m+l<J^£  and  ym“0,  for  •'’>1  and 
yi1 • -YI  3uch  that  not  all  bits  yj  are  equal  to  one. 

Hi)  If  the  user  is  in  active  resolution  mode  at 
time  tj,  and  observes  success,  he  becomes  un¬ 
blocked. 

Statements  1  to  3  above,  basically  describe  a  tree 
search  ar  in  [6).  The  difference  here  is  that  this 
search  is  not  performed  simultaneously  by  all  users. 
Thus,  it  is  possible  that  when  some  user's  search 
reaches  a  leaf  node,  a  collision  occurs.  Then,  as 
statements  1  to  3  indicate,  the  user  interprets  this 
collision  as  a  root  collision,  and  he  reinitiates  his 
tree  search. 

B.  Properties  of  the  BCRLS 

As  we  explained  previously,  the  BCRLS  tree  search 
is  not  performed  simultaneously  by  all  the  2n  users. 
Indeed,  each  user  initiates  his  own  BCRLS  when  he  be¬ 
comes  blocked.  Therefore,  at  some  point  in  time*  there 
may  be  some  blocked  users  at  various  levels  of  their 
tree  search,  and  some  unblocked  users.  The  unblocked 
users  do  not  inspect  the  feedback.  Among  the  blocked 
users,  there  will  be  some  in  active  resolution  mode  and 
some  in  withholding  resolution  mode.  Any  possible 
collision  will  be  caused  by  the  active  users,  and  will 
be  observed  by  all  the  blocked  usets.  Among  the  users 
who  are  blocked  and  in  withholding  mode,  we  will  single 
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1. 

out  those  who  are  either  in  state  (2y,  Xx*,,xn-k-l  0); 
CKk<n-l,  or  in  some  state  (2"  Xx,,-Xn-Ic-1  01...  1); 
l£k<n-l,  where  ^  are  t*le  ^rst  n-k  bits  of 

the  user's  codeword.  To  discriminate  between  those 
users  and  the  remaining  blocked  and  withholding  users, 
we  will  denote  the  resolution  mode  of  the  first  2£e. 

4s  shown  in  [18],  if  a  user  is  in  resolution  mode  2ue 
and  observes  an  either  emptv  or  a  successful  slot,  he 
moves  to  resolution  mode  2k.  Let  us  denote, 

({2£,Nk);0<k<n,{2£e,Pk};0<k<n-lt[M]):  The  event  that  at 
some  point  in  time  there  are  M  unblocked  users, 

Nk ; 0£k£  n  users  in  resolution  mode  2k,  and  P ^ ; (Xk<n-1 
users  in  resolution  mode  2^.  (1) 

We  now  present  a  proposition  whose  proof  is  in 

[18]. 

Preposition  1 

Given  2n  users,  consider  the  event  ({23,^}; 

0< k< n ,  {2£e,Pk}  ;(Kk<n-l,  [M] )  ,  at  some  point  in  time. 
Then,  for  every  k£n-l  such  that  (^>0,  the  first  n-k 
codeword  bits  of  the  users  are  identical;  thus  the 
users  branch  off  the  same  tree  node  at  depth  n-k. 

The  same  is  true  for  the  Pk  users,  if  Pk>0 .  Therefore, 
Njt<2*c,  and  Pi<£2k. 


Let  us  now  suppose  that  at  some  tine  instant  t, 
the  event  ({2^,Nk) ;0<k<n, {2£e ,Pk} ;0<k<n-l [M] )  occurs. 
For  some  k  such  tuat  0£k£n,  let  the  codewords  of  the  Nk 
users  have  the  common  prefix  Xj (k) . .  .Xj^fk)  ;  where  for 
k*n  this  prefix  is  the  tree  root.  Among  those  u«ers, 
let  us  have  with  common  codeword  prefix  Xx('.)... 
xn_k(k)0;  let  v.s  have  Nk2  with  common  codeword  prefix 
x^(k) .. .xn_k(k)l.  For  some  k  such  that  0£k£n-l,  let 
the  codewords  of  the  Pk  users  have  the  common  prefix 
yx(k) , .  .yn_k_i(k)l.  Then,  y^k) . .  .yn_k_x(k)l  cannot  be 
identical  to  Xj(k) . . .xn_k(k) .  If  P0+N0-2,  then  yx<0).. 
.y„_l(0)  and  Xx<0)  . .  ^.xC0)  are  identical.  This  last 
statement  evolves  from  proposition  1,  and  it  is  proved 
n  n 

in  [18].  If  53  N.*0,  slot  t  is  empty.  If  £  N.“l, 

k»0  K  k=0 

slot  t  is  a  successful  flot.  and  then  the  one  active 

n 

user  becomes  unblocked.  If  ^  N. _ 2,  s'ot  t  is  a 

k=0 

collision  slot.  'The  above  observations  lead  to  some 
simplifications  regarding  the  events  in  (1)  and  their 
transitions  in  time.  To  show  that,  let  us  first 
denote, 

<{2k,Nk};l<k<n,(Mj):  The  evc.it  such  that  at  some  point 
in  time  there  are  M  unblocked  users,  Nk;  l£k<n  users 
in  resolution  mode  2^,  and  no  users  in  resolution 
mode  2°.  (2) 

We  now  present  a  proposition  whose  proof  is  in 

[18]. 

v repos it ion  2 

Let  at  some  time  t  the  collision  event  ({2k,Nk}; 
l<k<n,  ]K])  occur.  Let  for  some  k:l£k<n  be  Nkj  users 
with  codeword  prefix  xx(k) . . .xn_k(k)0,  and  Nk2  users 
with  codeword  prefix  xx(i ) . . .xn_k(k) 1 .  Then,  none  of 
n 

the  users  and  the  users  that  are  in  withholding 

k=l  u 

mode  at  time  t  become  unblocked,  before  all  the  N  . 

k5i  ki 

users  do,  and  before  all  the  users  who  become  blocked 
in  the  mean  time  transmit  successfully. 


Let  us  now  denote, 


L({2k,Nk) ;l<k<n, [M]) :  the  expected  number  of  slots 
needed  for  the  resolution  pf  the  collision  re¬ 
presented  by  the  event  ({2k,Nk) ;l<k<n, [M ] )  in  '2), 
lust  after  this  collision  has  been  observed.  (3} 

From  the  derivations  in  [18],  we  obtain, 

.  n 

L({2*,N.  };l<k<n,[M])  -  J  T,  N.  >  2 
a  k  k-1  k 


2+L  ( {  2* ,  s+N12  } ,  {  2*"1 ,  Nk2)  ;  2<k<n ,  [ H+ £  «u"8 1 ) 

'  lf  ”  +  £  Nkl  1  1 
2+L({2",nH-M1},{2^'1,Nkl};  2<k<n,  I-l-m]) 


+L({2",s}>(2^~1,Nk2h  2<k<n,[M+N1  +  £  Nkl's)) 
n 

;  if  m  +  53  Nkl  >  2  (4) 

k«l 


Let  us  now  consider  the  Nk  users  who  are  in  reso¬ 
lution  mode  ?k.  Those  users  have  a  common  codeword 
prefix  Xx(k) . . .xn_k(k) ,  and  they  are  at  most  2k.  Let 
us  define  by  P(Nkx?Nk),  the  probability  that  given  the 
number  Nk,  there  are  Nkx  users  whose  codeword  prefix 


ic  xx(k)  .  ..x,.  ,k(k)0. 
trkl,  and  clearly. 


Then,  Nk2  will  be  eoual  to  Nk- 


P(Nkl/Nk> 


<  mm  (2k_1,Nk)  (5) 

III.  THE  BCRLS  PERFORMANCE 

As  we  saw  in  section  II,  if  at  some  time  instant 
t  the  event  ( {2k,Nk} ;l£k£n, (M|)  occurs,  then  M  denotes 
the  number  of  users  who  are  unblocked  at  t.  To  this 
point,  we  made  no  assumptions  as  to  the  transmission 
characteristics  of  rho.-.e  users.  Here,  we  will  assume, 
as  in  [6],  that  an  unblocked  user  transmits  with  prob¬ 
ability  q  per  slot.  Thus,  if  at  some  point  in  time 
the  number  of  unblocked  users  is  M,  then  the  proba¬ 
bility  Q(m,M)  that  m  users  will  transmit  is  given  by 
the  following  expression. 

Q(m,M)  -  qm(l-q)M~“;0<m<M  (6) 

The  implication  behind  the  expression  in  (6)  is 
that  the  unblocked  users  transmit  independently.  This 
is  consistent  with  the  independence  assumption  made  at 
the  beginning  of  this  paper.  Using  expressions  (4), 
(5),  and  (6),  we  can  express  an  equation  for  the 
expected  value  L( {2k,Nk} ;l<k<n, [M] ) .  This  equation  is 
given  *n  [18],  it  relates  the  expected  values  of 
different  events  as  in  (2),  and  it  determines  a  linear 
system  of  equations  whose  solution  is  the  set  of 
expected  vai.'os  as  in  (3). 


Jlef ■  lltlon  2 

Given  2n  users,  the  throughput  2nqn  of  the  BCRLS 
protocol  is  such  that  every "q  less  than  qn  provides  a 
bounded  and  nonnegative  solution  for  the  linear  system 
in  [18],  and  no  q  value  larger  than  qn  does. 


For  given  q  value,  the  solution  of  the  linear 
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system  in  (18]  can  be  obtained  numerically.  The 
throughput  2nqn  can  be  also  found  numerically  through 
the  trials  of  different  q  values.  We  will  present  such 
numerical  results  in  section  IV.  Let  us  define, 

D(2n,q);  The  expected  number  of  slots  for  the  resolution 
of  some  collision  at  t+1,  given  2n  users,  given  that 
at  t  all  users  are  unblocked,  and  given  that  each 
unblocked  user  transmits  with  probability  q. 

The  quantity  D(2n,q)  above  is  parallel  to  the 
expected  delay  E{delay]  in  (6),  and  it  is  clearly  given 
by  the  following  expression. 

2«  n  n 

0(2n,q>l  +  2  qN(l-q)2  'NL({21’  N},[2n-N])  (7) 

N-2  N  a 

For  2n  users,  the  stability  region  of  the  BCRLS 
consists  of  those  2nq  values  that  provide  a  bounded  and 
nonnegative  solution  for  the  linear  system  in  [18].  For 
such  2nq  values,  the  lengths  L({  2",N} ,  [ 2n— K] ) ; 2<_N£2n 
will  be  finite;  thus  collisions  that  are  signified  by 
events  ({2jJ,N} ,  [2n-N])  will  end  in  finite  time.  Fur¬ 
thermore,  due  to  proposition  2,  when  such  sollisions 
end,  all  the  users  will  be  unblocked.  In  conclusions, 
the  quantity  D(2n,q)  provides  the  expected  length  of 
independent  eplrodes,  and  D(2n,q)-1  bounds  from  above 
the  expected  per  packet  delays  (measured  in  slots). 

IV.  NUMERICAL  RESULTS  AND  0CNCLUS10NS 

Due  to  lack  of  space,  we  exhibit  here  numerical 
results,  without  discussion.  In  table  IV. 1,  lower 
bounds  on  the  BCRLS  throughput  are  listed.  In  figure 
IV. 1,  upper  bounds  on  transmission  delays  are  included. 
In  table  IV. 3,  lower  bounds  2nq*n  on  the  malntable  by 
the  BCRLS  arrival  rates  are  exhibited,  and  compared 
with  the  corresponding  such  rates  T„  and  T„  in  [6],  for 
the  nondynamic  and  the  dynamic  tree  search  respectively. 
Details  on  the  above  aspects  can  be  found  in  (IF], 


#  of  Users 

~n  * 

2  •«» 

23;n-3 

.47192 

2*;n«4 

.44336 

2^;n*5 

.43006 

2^;n”6 

.42368 

n  -*  « 

.36 

Table  IV. 1 

Lower  Bounds  on  the  BCRLS  Throughput 


t  of  Users 

2nq*_ 

T 

d 

T 

Han 

n 

n 

23;n«3 

.725 

.... 

.... 

2^;n«4 

.630 

.... 

.... 

2S;n-5 

.560 

.... 

.... 

2^;n»6 

.515 

.420 

.675 

n  oo 

.36 

.346 

.429 

Table  IV. 3 

Lower  Bounds  on  the  Maintainable  by  the  BCRLS 
Input  Rates 
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SESSION  1:  SOON  10-250: 

Chalrain:  Protestor  Michael  Athene,  MIT 
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9:00  A.M. 

REGISTRATION 

9:00  - 

9:30  A.M. 
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Professor  Michael  Athene,  MIT 

Dr.  Charles  J.  Boll  nd,  ONR 
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ISSUES  OF  STRATEGIC  C*  I 
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Chatman:  Professor  Robert  R.  Tenney,  MIT 
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2:30  -  3:00  P.M.  THE  USE  OF  RECIPROCAL  POLAR  CO-ORDINATES  IN  PASSING 

TRACKING 

A.  Daaaford,  SCICON,  Ltd. 

3:00  -  3:30  P.M.  MANEUVERING  FOR  BEARINGS -ONLY  ESTIMATION 

G.  Z.  Vilholt ,  U.  S.  Navy 
R.  R.  Tenney,  MIT 
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3:30  -  4:00  P.M 


BREAK 


4:00  -  4:30  P.M.  GENERALIZED  TRACKER/ CLASSIFER  (CTC)  -  A  SYSTEM  FOR 

TRACKING  AND  CLASSIFICATION  OP  MULTIPLE  TARGETS 

S.  Mori,  C,  Y.  Chong,  and  R.  P.  Viahaar 
Adraacad  Inform tlo-  and  Daelaloa  Syitiat 

B.  Taa,  Staaford  UalTtralty 


4:30  -  3:00  P.M.  OPTIMAL  MANEUVER  DETECTION  AND  ESTIMATION  IN 

MULTIOBJECT  TRACKING 

T.  Kartaa  aad  A.  Bllta,  ALPHATECH,  lac. 


3:00  -  3:30  P.M.  A  FRAMEWORK  FOR  SURVEILLANCE  ALGORITHM  EVALUATION 

J.  Valaa,  Drapar  Laboratory 
I.  R.  Tannay,  MIT 


MONDAY  AFTERNOON,  JULY  23,  1983 

SESSION  3:  ROOM  10-230:  C*  SYSTEM  EVALUATION 

Chalraaa:  Dr.  Alazaadar  H.  Lavla,  MIT 

1:30  -  2:00  P.M.  REQUIREMENTS  FOR  TEB  NAVY  TACTICAL  C*  SYSTEM 

L.  S.  Patara,  SRI  Iataraatioaal 

2:00  -  2:30  P.M.  A  LARGE-SCALE  CONTROL  SYSTEM  ANALYSIS  METHODOLOGY 

D.  R.  Frtadaaa,  Tha  MITRE  Corp. 

2:30  -  3:00  P.M.  TECHNIQUES  FOR  DETECTING  COVER  AND  DECEPTION 

R.  Garaas,  BETAC,  Corp. 

3:00  -  3:30  P.M.  A  COMPARISON  OF  DEFENSE  C*  MODELS 

J.R.  Dowd  la,  M.P.  Marrlmn,  R.  F.  Oaadroa,  aad 
L.C.  Kxaaar 
ALPHATECH,  Iae. 


3:30-4:00  P.M. 

4:00  -  4:30  P.M. 

4:30  -  3:00  P.M. 

3:00  -  3:30  P.M. 
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TUESDAY  MO IN ING,  JULY  26,  1963 

SESSION  4;  BOOM  10-250 

Chi  train:  Profeeeor  Robert  R.  Tenney 

1:30  -  9:00  A.M.  REGISTRATION 

9:00  -  9:30  A.M.  SURVEILLANCE  RESEARCH  AT  MIT 

R.  R.  Tenny.  MIT 

9:30  -  10:00  A.M.  DISTRIBUTED  ESTIMATION  IN  THE  MIT/LL  DSN  TEST-BED 

J.  R.  Deleney,  R.  T.  Leeoee,  P.  E.  Green 
Lineoln  Labor* tory 

10:00  -  10:30  A.M.  DECISIONMAKING:  A  COMMANDER’S  CONSIDERATION 
W.  Meyer*.  U.  S.  Navy  (Rat.  Adairal) 

10:30  -  11:00  A.M.  BREAK 

11:00  -  11:30  A.M.  ARTILLERY  CONTROL  ENVIRONMENT 

S.  Wolff,  Ball  let le  Raaearch  Laboratory 

11:30  -  12:00  P.M.  COMMAND  DECISION  MAKERS  AND  THEIR  MODES  OF 
INTERACTION 

P.  D.  Morgan ,  SCICON.  Ltd. 

12:00  -  1:30  P.M.  LUNCH  BREAK 

TUESDAY  AFTERNOON,  Jnly  26,  19S3 

SESSION  3:  ROOM  10-105:  DISTRIBUTED  ESTIMATION  AND  DATA  FUSION 

Chalrnen:  Profeeeor  Robert  R.  Tenney,  MIT 

1:30  -  2:00  P.M.  MULTI-SENSOR  FUSION  IN  A  MULTI-TARGET  ENVIRONMENT 

R.  N.  Lobbia  and  D.  L.  Alepach,  ORINCON 

2:00  -2:30  P.M.  DISTRIBUTED  ESTIMATION  SYSTEMS 

C.  T.  Chong,  S.  Mori,  Advanced  Inforaetion  anJ 

Deeiaion  Syeteae 

E.  Tee,  Stanford  Univerelty 

2:30  -  3:00  P.M.  DISTRIBUTED  ESTIMATION  WITH  COMMUNICATION  DELAY 

A.  Oxbek  and  R.  R.  Tenney,  MIT 

3:00  -  3:30  P.M.  TRACK  ASSOCIATION  ALGORITHMS 

A.  Baaford,  SCICON,  Ltd. 

3:30  -  4:00  P.M.  BREAK 
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4:00  -  4:30  P.M.  INCA:  AN  ENVIRONMENT  FOR  EXPLORATORY  DEVELOPMENT  OF 

TACTICAL  DATA  FUSION  TECHNIQUES 

L.  S.  Grot*  and  H.  J.  Pijrna,  VERAC,  Inc . 

4:30  -  3:00  P.M.  A  UNIFIED  APPROACH  TO  MODELING  AND  COMBINING  OF 

EVIDENCE  THROUGH  RANDOM  SET  THEORY 

I.  R.  Goodaan,  NOSC 

3:00  -  3:30  P.M.  EXPERT  SYSTEMS  APPLIED  TO  AUTOMATED  MILITARY 

INTELLIGENCE 

M.  B.  Woable,  Lockheed  Miaatla  aad  Space 

TUESDAY  AFTERNOON.  July  26,  1983 

SESSION  <:  ROOM  10-230  :  ORGANIZATION  MODELS  AND  DESIGN 
Chal nun:  Dr.  Alexander  H.  Levis,  MIT 

1:30  -  2:00  P.M.  DISTRIBUTED  OPTIMIZATION  ALGORITHMS  WITH 

COMMUNICATIONS 

J.  N.  Tsitsiklis  and  M.  Athana.  MIT 

2:00  -  2:30  P.M.  A  LAYERED  MOCEL  FOR  COMMAND-CONTROL 

W.  F.  Orleee,  Hazel  tine  Corp. 

2:30  -  3:00  P.M.  MODELING  THE  ASW  TACTICAL  C*  DECISION  PROCESS 

M.  Alexandridla,  3.  Deokert.  E.  Batin,  and  J.  G.  Wohl 
ALPHATBCH,  lao. 

3:00  -  3:30  P.M.  THE  DESIGN  OF  INFORMATION  STRUCTURES: 

BASIC  ALLOCATION  STRATEGIES  FOR  ORGANIZATIONS 

D.  A.  Stabile,  ICF,  Inc. 

A.  H.  Levie,  MIT 

3:30  -  4:00  P.M.  BREAM 

4:00  -  4:30  P.M.  MESSAGE  STANDARDS  IN  SOFTWARE  ENGINEERING 

J .  V.  Bronson,  Col.  U.  S.  Marine  Corpa 

4:30  -  3:00  P.M.  MULTI -OBJECTIVE  DECISIONMAKING, INCOMPLETE  INFORMATION 

AND  COORDINATION  PROBLEM  IN  C*  SYSTEMS 

K.  Loparo  and  M.  Meaarovie 
Caae  Western  Reserve  University 


WEDNESDAY  MORNING,  JOLT  26.  1983 


SESSION  7:  BOOM  10-25" 

arint»:  Prof i»«or  kobart  B.  Taaaap.  MIT 

8:30  -  9:00  A.M.  REGISTRATION 

9:00  -  9:30  A.M.  COMMAND  AND  CONTROL  ORGANIZATIONS  FOR  NAVAL  BATTLE 

FORCES 

M.  Athipa,  MIT 

9:30  -  10:00  A.M.  INDIVIDUAL  DIFFERENCES  IN  MILITARZ  DECISIONMAKING: 

THE  CLASSIFICATION  PERFORMANCE  OF  ACTIVE  SONAR 
OPERATORS 

J.  G.  *ohl.  ALPHATECH,  lac. 

10:00  -  10:30  A.M.  DECISION  AND  DISPLAI  ANALYSIS  IN  A  SIMPLE 
SURVEILLANCE  PROBLEM 

R.  L.  Harihata,  F.  L.  Graitxar,  NPRDC 

10:30  -  11:00  A.M.  BREAM 

11:00  -  11:30  A.M.  DISTRIBUTED  ASYNCHRONOUS  ALGORITHMS 

D.  P.  Bartaakia,  MIT 

11:30  -  12:00  P.M.  HYBRID  ROUTING  IN  COMMUNICATION  NETWORKS 

A.  Sagall ,  J.  M.  Jaffa  aad  F.  H.  Mo a a 
IBM  Tkoaaa  I.  Ration  Raiairch  Caatar 

12:00  -  1:30  P.M.  LUNCH  BREAK 

WEDNESDAY  AFTERNOON.  JULY  2?.  1983 
SESSION  8:  ROOM  10-105:  COMMUNICATIONS 
Chi 1  naan:  Profataor  Robart  R.  Tana ay,  MIT 

1:30  -  2:00  P.M.  INTERSHIP  SENSOR  ALIGNMENT  USING  UHF  TACTICAL  DATA 

LINE  RELATIVE  NAVIGATION 

R.  H.  Ovartoa.  Coapttk  Raaaareb.  Iae. 

2:00  -  2:30  P.M.  COMMUNICATIONS  SUPPORT  OF  DISTRIBUTED  COMMAND  AND 

CONTROL 

G.  A.  Clapp.  NOSC 

2:30  -  3:00  P.M.  TEE  USE  OF  SUMMARY  DISPLAY  TRANSFERS  FOR  COORDINATION 

BETWEEN  COMMAND  FACILITIES  FOR  THE  COMPOSITE  WARFARE 
COMMANDER  AND  THE  ANTI-AIR  WARFARE  COMMANDER  OF  A 
NAVY  BATTLE  GROUP 

C.  J.  Graat,  Tka  Jobaa  Bopkiaa  Univaralt) 
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3:00  -  3:30  P.N.  BREAK 

3:30  -  4:00  P.M.  DATA  BASE  REDUNDANCY  IN  VULNERABLE  COMMUNICATION 

NETWORKS 

N.  Ik.  tad  N.  A  than*.  NIT 

4:00  -  4:30  P.N.  A  COLLISION  RESOLUTION  PROTOCOL  WITH  UNITED  CHANNEL 

SENSING  FINITELY  WANT  USERS 

P.  Ptptatoai-Ktitkot,  G.  D.  Ntrens,  N.  Gtorgiopoulot 
Uaiverilty  of  Coaatetieut 

4:30  -  3:00  P.N.  OPEN 

3:00  -  6:00  P.N.  VINE  AND  (SEESB  (BUSH  BOON  10-103) 

WEDNESDAY  AFTERNOON,  JULY  27.  19S3 

SESSION  9:  BOON  10-230:  HUNAN  DECISION  MODELS 

Chtlrata:  Dr.  AXtztndtr  H.  Ltwit,  NIT 

1:30  -  2:00  P.N.  HUNAN  DECI  SION  MAKING  IN  BTNANIC  ENVIRONMENT  WITH 

INCREASING  INFORMATION  PROCESSING  DEMANDS 

D.  Serfity.  UaiTtriity  of  Coaatotteat 

2:00  -  2:30  P.N.  C*I  SYSTEM  DESIGN  AND  DEVELOPMENT:  INSIGHT  FROM  THE 

BEHAVIORAL  SCIENCE  PERSPECTIVE 

D.  L.  Finley,  Army  Retetrek  Inetitate 
V.  P.  Cherry,  Vector  Retetrah,  Iao. 

2:30  -  3:00  P.N.  MODELS  OF  INFORMATION  STORAGE  AND  MEMORY  IN  C* 

SYSTEMS 

S.  A.  Bill,  ECA.  Ibv. 

A.  H.  Levi*.  MIT 

3:00  -  3:30  P.N.  BREAK 

3:30  -  4:00  P.N.  THE  COGNITIVE  ORGANIZATION  OF  SUB  KARINE  SONAR 

INFORMATION:  A  MULTIDIMENSIONAL  SCAUNG  ANALYSIS 

K.  Lexer,  G,  Moeller,  tad  V.  Roger* 

Nival  Sabatrlae  Medlotl  Retetrek  Laboratory 

4:00  -  4:30  P.N.  THE  EFFECTS  OF  SPATIAL  INFORMATION  DISPLAYS  ON 

DECI ION  MAKING  PERFORMANCE  IN  TACTICAL  C*  SYSTEMS 

B.  D,  Seott  tad  C.  D.  Wicken* 

Unirereity  of  Illlaoit 

4:30  -  3:00  P.N.  DEVELOPMENT  OF  A  GENERALIZED  HUNAN-MACHINE  INTERFACE 

R.  B.  Kaoz,  Lodheed  Electronic*  Co..  Iao. 

3:00  -  6:00  P.N.  VINE  AND  CHEESE  (Bath  Rooa  10-103) 


THURSDAY  MORNING,  JOLT  28.  1983 

SESSION  10:  ROOM  10-250 

Chatrnan:  Dr.  Alexander  R.  Laris,  MIT 

8:30  -  9:00  A.M.  REGISTRATION 

9:00  -  9:30  A.M.  PROBLEMS  IN  THE  ANALYSIS  AND  EVALUATION  OF 

INFORMATION  PROCESSING  AND  DECISIONMAKING 
OR'  NIZATIONS 

A.  H.  Lewis,  MIT 

9:30  -  10:00  A.M.  DOING  C*  EXPERIMENTS  WITH  WAR  GAMES 

J.  S.  Lawson,  Jr.,  NAVELEX 

10:00  -  10:30  A.M.  DESIGN  OF  A  SYSTEM  EXECUTIVE  FOR  THE  MANAGER) OT  OF 
COMMAND/ CONTROL  FACILITIES 

E.  R.  Docot,  MIT 

10:10  -  11:00  A.M.  BREAK 

11:00  -  11:30  A.M.  A  KNOWLEDGE  BASED,  INTERACTIVE  PROCEDURE  FOR  PLANNING 
AND  DECISION  SUPPORT  UNDE'  UNCERTAINTY  AND  PARAMETER 
IMPRECISION 

A.  P.  Sags  and  C.  C.  Whit*.  Ill 
Uniwars Itjr  of  Virginia 

11:30  -  12:00  P.M.  DATA  INTEGRATION  FOR  COMMAND  AND  CONTROL  SUPPORT 
SYSTEMS 

J.  C.  Machado,  NAVELEX 

12:00  -  1:30  P.M.  LUNCH  BREAK 

THURSDAY  AFTERNNON,  JULY  28,  1983 

SESSION  11:  ROOM  10-105:  THEORETICAL  ADVANCES 

Oiairaan:  Professor  Robert  R.  Tenney,  MIT 

1:30  -  2:00  P.M.  A  MINIMUM  SENSITIVITY  INCENTIVE  CONTROL  APPROACH  TO 

TEAM  PROBLEMS 

T.  Baaar,  J.  B.  Crnx,  Jr.,  and  D.  Canaewar 
University  of  Illinois 

2:00  -  2:30  P.M.  SUBJECTIVE  GAMES  OF  INCOMPLETE  INFORMATION 

D.  Teioketxi*  and  D.  Castanon,  ALPHATECH,  Inc, 

2:30  -  3:00  P.M.  OPTIMAL  STRUCTURES  IN  DYNAMIC  TEAM  PROBLEMS 

R.  R.  Tenney,  MIT 
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3:00  -  3:30  P.M 


ON  THE  COMPLEXITY  OF  DISTRIBUTED  DECISION  PROBLEMS 

J.  Tsitsiklis  and  M.  Athens,  M7T 

3:30  -  4:00  P.M.  BREAK 

4:00  -  4:30  P.M.  ON  THE  ASSIGNMENT  OF  LEADERSHIP  IN  STACXELBERG  GAMES 

K.  Lopa.-o,  Y.  Choe,  and  S.  Kahne 
C.se  Western  Reserve  University 

4:30  -  5:00  P.M.  DYNAMIC,  HIERARCHICAL  DECISION  PROBLEMS 

P.  B.  Lull,  and  T.  S.  Chang,  University  of  Connecticut 

5:00  -  5:30  P.M.  ON  THE  CONTROL  OF  SYSTEMS  WITH  ABRUPTLY  CHANGING 

STRUCTURE 

B.  E,  Griffiths  and  K.  A.  Loparo 
Caae  Western  Reserve  University 

THURSDAY  AFTERNNON,  JULY  28,  1983 
SESSION  12:  ROOM  10-250:  DECISION  AIDS 
Chairman:  Ms.  Elizabeth  R.  Ducot,  HIT 

1:30  -  2:00  P.M.  A  MAN- MACHINE  INTERFACE  CONCEPT  FOR  A  STATE-0 F- ART, 

PROGRAMMABLE ,  SHIPBOARD,  COMMAND/ CONTROL  CONSOLE 

G.  Osga  and  R.  Fleming,  NOSC 

2:00  -  2:30  P.M.  TACTICAL  DECISION  AIDS  WHICH  QUANTIFY  JUDGEIinNT 

V.  Drown,  Decision  Science  Consortium,  Inc. 

2:30  -  3:00  P.M.  EVIDENTIAL  REASONING  FOR  SITUATION  ASSESSMENT 

T.  D.  Garvey  and  J.  D.  Lowrance,  SRI  International 

3:00  -  3:30  P.M.  AN  EVALUATION  OF  ARIADNE 

C.  C.  White,  IZI  and  A.  P.  Sage 
University  of  Virginia 

3.30  -  4:00  P.M.  BREAK 

4:00  -  4:30  P.M.  PERSONALIZED  AND  PF.RSCRITPIVE  DECISION  AIDS 

M.  S.  Cohen,  MAXIMA  Corp. 

4:30  -  5 : VO  P.M.  SIM:  A  SMART  INTERFACE  TO  A  MODEL 

A.  L.  Blitz,  R.  R.  Tenney,  L.  C.  Kramer 
ALP HA  TECH ,  Inc. 

5 : 00  -  5:30  P.M.  ON  THE  NEED  FOR  WELL  STRUCTURED  C*  EXPERIMENTS 

F.  C.  Deckelmsn,  NAVELEX 
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